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Nucleic Acids Encoding [His^TIp^,Ty^^]-GnRH Preprohonnone and 
5 [Ser^-GnRH Preprohonnone and Their Uses 

BACKGROUND OF THE INVENTION 
This invention relates to DNA and protein 

10 compositions useful, for instance, in the regulation of 
reproductive function, and which are also useful in the 
development of transgenic animals with desirable reproductive 
characteristics. Specifically, this invention relates to DNA 
and protein compositions for the precursor protein for the 

15 chicken II form of gonadotropin-releasing hormone and to the 
tSer®] -GnRH preprohonnone. 

Gonadotropin-releasing hormone (GnRH) is an important 
reproductive hormone in vertebrates. GnRH is a neuropeptide 
which regulates the production of follicle stimulating hormone 

20 (FSH) and luteinizing hormone (LH) . In mammals, GnRH is 

produced by the hypothalamus and regulates the release of FSH 
and LH, which in turn regulate the development and function of 
the reproductive organs. 

There are eight different forms of GnRH that have 

25 been identified in different vertebrate species (see figure 
1.) For a review of the species distribution amd properties 
of the different forms of GnRH, See Sherwood, N.M. , et al. 
(1993) Endocrine Rev. 14:241-254. For instance, the more 
recently evolved mammals have a form of GnRH designated 

30 mammalisui GnRH. A number of primitive placental mammals, 

nonplacental mammals and other vertebrates have more than one 
form of GnRH. One form of GnRH that is particularly 
widespread in this latter group of animals and has been named 
chicken II GnRH, since it was the second form of GnRH that was 

35 discovered in the chicken. This form of GnRH is also known as 
tHis^,Trp'^,Tyr®] -GnRH since it has amino acid substitutions in 
the 5, 7 and 8 positions as compared to mammalian GnRH (see 
figure 1) . 
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[His5,Trp"',Tyr8] -GnRH is a potent stimulator of 
reproductive function in a wide variety of species. It is 
particularly active in fish. [His^.Trp^.Tyr^] -GnRH is the 
most potent form of GnRH in fish. (See Sherwood, N.M. , et al., 
supra and Ngamvongchon, S., et al. (1992) Begrulatoiy Peptides 
42 : 63 - 73 . ) [His^ , Trp'' , Tyr^] -GnRH also apparently has 
significant biological activity in more recently evolved 
mammals . 

One of the problems in fish aquaculture is control 
of fish reproduction. For some species of fish, it is 
difficult to induce the fish to reproduce in captivity. 
Alternatively, it may be difficult to induce fish in captivity 
to reproduce at the time of year that is normal to the 
species. One approach to the control of reproduction in fish 
for aquaculture is the development of transgenic fish by 
introduction of DNA encoding the tHis5,Trp'',Tyr®] -GnRH 
preprohormone and the [Ser®] -GnRH preprohormone . 

Many of the potential uses of GnRH, for example fish 
aquaculture, require identification and isolation of the gene 
encoding the precursor protein from which the peptide is 
produced. The sequence of the lHis*,TrpVTyr8j -GnRH and the 
[Ser®] -GnRH preprohormones and the sequence of the genes which 
encode these proteins have not been described in the prior 
art. Identification of the genes encoding these sequences 
will facilitate production and use of the decapeptide hormones 
and other biologically active fragments in a variety of 
applications. These and other needs are addressed by the 
present invention. 



SUMMARY OF THE INVENTION 

The present invention provides ccanpositions for 
isolated nucleic acids encoding vertebrate [HisS,Trp'',Tyr®3 - 
GnRH preprohormones and tHis5,Trp"',Tyr8] -GnRH preprohormone 
GAP peptides. The invention also provides for isolated 
nucleic acids encoding vertebrate [Ser^] -GnRH preprohormones 
and ISer*]-6nRH preprohormone GAP peptides. In addition, the 
invention also provides for conpositions con5>rising the above 
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preprohormone polypeptides • The preprohormone polypeptides 
may be, for example, recombinantly produced. 

Nucleic acid probes that selectively hybridize to 
nucleic acids encoding the above preprohormone polypeptides 
are also provided. The invention provides methods of 
detecting nucleic acids encoding the preprohormone 
polypeptides by nucleic hybridization assays utilizing these 
probes. Antibodies that are reactive with the preprohormone 
polypeptides are provided, along with immunoassay methods 
based on these antibodies and which detect the preprohormone 
polypeptides. These nucleic acid hybridization assays and 
immunoassays are useful as measurements of reproductive 
function in a variety of different situations. 

The invention further provides for transgenic 
animals created by the introduction of a DNA construct 
encoding a vertebrate [His^^Trp'^^Tyr®] -GnRH preprohormone or a 
[Ser^] -GnRH preprohormone. These transgenic animals include 
fish species useful in aquaculture and farm animal species, 
such as chickens and other animals. 

DEFINITIONS 

Abbreviations for the twenty naturally occurring 
amino acids follow conventional usage. In the polypeptide 
notation used herein, the left-hand direction is the amino 
terminal direction and the right-hand direction is the 
carboxy- terminal direction, in accordance with standard usage 
and convention. Similarly, unless specified otherwise, the 
left hand end of single- stranded polynucleotide sequences is 
the 5' end; the left hand direction of doiible- stranded 
polynucleotide sequences is referred to as the 5' direction. 
The direction of 5' to 3* addition of nascent RNA transcripts 
is referred to as the transcription direction; sequence 
regions on the DNA strand having the same sequence as the RNA 
and which are 5' to the 5' end of the RNA transcript are 
referred to as "upstream sequences"; sequence regions on the 
DNA strand having the same sequence as the RNA and which are 
3 V to the 3 ' end of the RNA transcript are referred to as 
"downstream sequences 
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The 


nucleotide codes shown 


below are used in the 




nucleic acid 


sequences of the invention. This code has been 




adopted by the lUPAC-IUB Biochemical 


Nomenclature Commission, 


5 


Code 


Group 






A 
C 


A 




10 


G 


C 


cytosine 




T 




guanine 


15 


U 


T 


thymine 




Y 


u 


uracil 




R 


C or T(U) 


pyrimidine 


20 


hi 


A or 6 


purine 




K 




amino 


25 


S 

w 


G or T(U) 

var or U 


keto 

J nyarogen bonds 




H 


A or T(D) 


2 hydrogen bonds 


30 


B , 


A or C or T(U) 


not-G 




V 


G or T(U) or C 


not -A 


35 


D 


G or C or A 


not-T(U) 




N 


G or A or T{U) 


not-C 




G,A,C or T(U) 


any 



40 

The term "nucleic acids", as used herein, refers to 
either DNA or RNA. "Nucleic acid sequence" or "polynucleotide 
sequence" refers to a single- or double- stranded polymer of 
deoxyribonudeotide or ribonucleotide bases read from the 5* 
45 to the 3' end. It includes both self -replicating plasmids, 
infectious polymers of DMA or RNA and nonfunctional DNA or 
RNA. 

The following terms are used to describe the 
sequence relationships between two or more nucleic acids or 
SO polynucleotides: "reference sequence", "con?>arison window", 
"sequence identity", "percentage of sequence identity", and 
"substantial identity". A "reference sequence" is a defined 
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sequence used as a basis for a sequence comparison; a 
reference sequence may be a subset of a larger sequence, for 
example, as a segment of a full-length cDNA or gene sequence 
given in a sequence listing, such as the nucleic acid sequence 
of figure 2, or may comprise a complete cDNA or gene 
sequence. Generally, a reference sequence is at least 20 
nucleotides in length, frequently at least 25 nucleotides in 
length, and often at least 50 nucleotides in length. Since 
two polynucleotides may each (1) comprise a sequence (i.e., a 
portion of the complete polynucleotide sequence) that is 
similar between the two polynucleotides, and (2) may further 
coir5)rise a sequence that is divergent between the two 
polynucleotides, sequence comparisons between two (or more) 
polynucleotides are typically performed by comparing sequences 
of the two polynucleotides over a "con5)arison window" to 
identify and compare local regions of sequence similarity. 

A "comparison window", as used herein, refers to a 
conceptual segment of at least 20 contiguous nucleotide 
positions wherein a polynucleotide sequence may be compared to. 
a reference sequence of at least 20 contiguous nucleotides and 
wherein the portion of the polynucleotide sequence in the 
comparison window may comprise additions or deletions (i.e., 
gaps) of 20 percent or less as compared to the reference 
sequence (which does not comprise additions or deletions) for 
optimal alignment of the two sequences. 

Optimal alignment of sequences for aligning a 
comparison window may be conducted by the local homology 
algorithm of Smith and WatermeOi (1981) Adv. Appl. Math. 2:482, 
by the homology alignment algorithm of Needleman and Wunsch 
(1970) 47. Mol. Biol. 48:443, by the search for similarity 
method of Pearson and Lipman (1988) Proc. Natl. Acad. Sci. 
(USA) 85:24^4, by computerized implementations of these 
algorithms SAP, BESTFIT, FASTA, and TFASTA in the Wisconsin 
Genetics Sorcware Package Release 7.0, Genetics Computer 
Group, 575 Science Dr., Madison, WI) , or by inspection, and 
the best alignment (i.e., resulting in the highest percentage 
of sequence similarity over the comparison window) generated 
by the various methods is selected. Preferred programs for 
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this use include GAP, BESTFIT AND PASTA, with standard 
parameters . 

The term "sequence identity" means that two 
polynucleotide sequences are identical (i.e., on a nudeotide- 
by-nucleotide basis) over the window of conparison. The term 
"percentage of sequence identity" is calculated by conparing 
two optimally aligned sequences over the window of comparison, 
determining the number of positions at which the identical 
nucleic acid base (e.g., A, T, C, G, U, or I) occurs in both 
sequences to yield the nximber of matched positions, dividing 
the number of matched positions by the total number of 
positions in the window of comparison (i.e., the window size), 
and multiplying the result by 100 to yield the percentage of 
sequence identity. 

The terms "substantial identity" or "substantial 
sequence identity" as applied to nucleic acid sequences and as 
used herein and denote a characteristic of a polynucleotide 
sequence, wherein the polynucleotide comprises a sequence that 
has at least 85 percent sequence identity, preferably at least 
90 to 95 percent sequence identity, more usually at least 99 
percent sequence identity as conpared to a reference sequence 
over a comparison window of at leaist 20 nucleotide positions, 
frequently over a window of at least 25-50 nucleotides, 
wherein the percentage of sequence identity is calculated by 
comparing the reference sequence to the polynucleotide 
sequence which may include deletions or additions which total 
20 percent or less of the reference sequence over the window 
of coirparison. The reference sequence may be a subset of a 
larger sequence, for exair^jle, as a segment of the full-length 
lHis5,Trp'',Tyr^]-GnRH and [Ser^] -GnRH sequences disclosed 
herein. 

As applied to polypeptides, the terms "sxxbstantial 
identity" or "substantial sequence identity" mean that two 
peptide sequences, when optimally aligned, such as by the 
programs GAP or BESTFIT using default gap weights, share at 
least 80 percent sequence identity, preferably at least 90 
percent sequence identity, more preferably at least 95 percent 
sequence identity or more (e.g., 99 percent sequence 
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identity) . Preferably, residue positions which are not 
identical differ by conservative amino acid substitutions. 
Conservative amino acid substitutions refer to the 
interchangeability of residues having similar side chains. 
For example, a group of amino acids having aliphatic side 
chains is glycine, alanine, valine, leucine, and isoleucine; a 
group of ami-no acids having aliphatic-hydroxyl side chains is 
serine and threonine; a group of amino acids having amide - 
containing side chains is asparagine and glutamine; a group of 
amino acids having aromatic side chains is phenylalanine, 
tyrosine, and tryptophan; a group of amino acids having basic 
side chains is lysine, arginine, and histidine; and a group of 
amino acids having sulfur- containing side chains is cysteine 
and methionine. Preferred conservative amino acids 
substitution groups are: valine- leucine- isoleucine, 
phenylalanine -tyrosine, lysine -arginine, alanine -valine, and 
asparagine - glutamine . 

The term "isolated" or "substantially pure" means a 
conpound is the predominant species present (i.e., on a molar 
basis it is more abxindant than any other individual species in 
the composition) , and preferably a substantially purif iec' 
fraction is a composition wherein the object species comp.ises 
at least about 50 percent (on a molar basis) of all 
macromoleculeu: species present. Generally, a substantially 
pure coDsposition will coitprise more than about 80 to 90 
percent of all macromolecular species present in the 
con?)osition. Mtast preferably, the species is purified to 
essential homogeneity (contaminant species cannot be detected 
in the conposition by conventional detection methods) wherein 
the composition consists essentially of a single 
macromolecular species. For exaitple, "isolated" or 
"substantially pure", when referring to nucleic acids, refers 
to those that have been purified away from other chromosomal 
or extrachromosomal DNA or RNA by standard techniques, 
including alkaline/SDS treatment, CsCl banding, colximn 
chromatography, and other techniques well known in the art. 
See, F. Ausxibel, et al., ed. Current Protocols in Molecular 
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Biology, Greene Publishing and Wiley- Interscience, New York 
(1987), incorporated herein by reference, 

"Nucleic acid probes" may be DNA or RNA fragments. 
DNA fragments prepared, for example, by digesting plasmid DNA, 
5 or by use of PCR, or synthesized by either the phosphoramidite 
method described by Beaucage and Carruthers, retraiiedron Lett. 
22:1859-1862 (1981), or by the triester method according to 
Matteucci, etal., J. Am. Cham. Soc, , 103:3185 (1981), both 
incorporated herein by reference. A double stranded fragment 

10 may then be obtained, if desired, by annealing the chemically 
synthesized single strands together under appropriate 
conditions or by synthesizing the complementary strand using 
DNA polymerase with an appropriate primer sequence. Where a 
specific sequence for a nucleic acid probe is given, it is 

15 understood that the complementary strand is also identified 

and included. The complementary strand will work equally well 
in situations where the target is a double -streuided nucleic 
acid. 

The phrase "selectively hybridizing to" refers to a 

20 nucleic acid probe that hybridizes, duplexes or binds only to 
a paLrticular target DMA or RNA sequence wheii the target 
sequences are present in a preparation of total cellular DNA 
or RNA. "Complementary" or "target" nucleic acid sequences 
refer to those nucleic acid sequences which selectively 

25 hybridize to a nucleic acid probe. Proper annealing 

conditions depend, for example, upon a probe's length, base 
composition, aind the number of mismatches and their position 
on the probe, and must often be determined empirically. For 
discussions of nucleic acid probe design and annealing 

30 conditions, see, for example, Sambrook et al., Moleculair 

Cloning: A Laboratory Manual (2nd ed.). Vols. 1-3, Cold Spring 
Harbor Laboratory, (1989) or Current Protocols in Molecular 
Biology, F. Ausubel et al., ed. Greene Publishing and Wiley - 
Interscience, New York (1987) . 

35 The phrase "nucleic acid sequence encoding" refers 

to a nucleic acid which directs the e:^ression of a specific 
protein or peptide. The nucleic acid sequences include both 
the DNA stremd sequence that is transcribed into RNA and the 
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RNA sequence that is translated into protein. The nucleic 
acid sequences include both the full length nucleic acid 
sequences as well as non-full length sequences derived from 
the full length protein. It being further understood that the 
S sequence includes the degenerate codons of the native sequence 
or sequences which may be introduced to provide codon 
preference in a specific host cell. 

The phrase "expression cassette", refers to 
nucleotide sequences which are capable of affecting expression 

10 of a structural gene in hosts compatible with such sequences. 
Such cassettes include at least promoters and optionally, 
transcription termination signals. Additional factors 
necessary or helpful in effecting expression may also be used 
as described herein. 

15 The term "operably linked" as used herein refers to 

linkage of a promoter upstream from a DNA sequence such that 
the' promoter mediates transcription of the DNA sequence. 

The term "vector", refers to viral expression 
systems, autonomous self -replicating circular DNA (plasmids) , 

20 and includes both expression and nonexpression plasmids. 

Where a recombinant microorganism or cell culture is described 
as hosting am "eacpression vector, " this includes both 
extrachromosomal circular DNA and DNA that has been 
incorporated into the host chromosome (s) • Where a vector is 

25 being maintained by a host cell, the vector may either be 

stcLbly replicated by the cells during mitosis as an autonomous 
structure, or is incorporated within the host's genome. 

The term "plasmid" refers to an autonomous circular 
DNA molecule capable of replication in a cell, and includes 

30 both the expression and nonexpression types. Where a 

recombinant microorganism or cell cult; re is described as 
hosting an "expression plasmid", this includes both 
extrachromosomal circular DNA molecules and DNA that has been 
incorporated into the host chromosome (s) . Where a plasmid is 

35 being maintained by a host cell, the plasmid is either being 

stcQDly replicated by the cells during mitosis e . an autonomous 
stmicture or is incorporated within the host's genome. 
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The phrase "recombinant protein" or "recombinantly 
produced protein" refers to a peptide or protein produced 
using non-native cells that do not have an endogenous copy of 
DNA able to express the protein. The cells produce the 
5 protein because they have been genetically altered by the 

introduction of the appropriate nucleic acid sequence. The 
recombinant protein will not be found in association with 
proteins and other subcellular components normally associated 
with the cells producing the protein. 

10 "Specifically immunoreactive" refers to a binding 

reaction between an antibody and antigen which is 
determinative of the presence of the antigen in the presence 
of a heterogeneous population of proteins and other biological 
macromolecules or in a biological sample. Thus, under 

15 designated immunoassay conditions, the specified antibodies 

bind to a particular protein and do not bind in a significant 
amount to other proteins present in the ssunple* Specific 
binding to an antibody under such conditions may require an 
antibody that is selected for its specificity for a particular 

20 protein. For example, antibodies raised to the H. burtoxii 
[H±s^, Trp'' , Tyr®] -GnRH prep^ with the amino acid 

sequence depicted in Seq. ID No, 2 or to the treeshrew 
lHis^,Trp'^,Tyr®] -GnRH preprohormone sequence depicted in Seq. 
ID No. 18 can be selected to obtain antibodies specifically 

25 immunoreactive with [His^,Trp'^,Tyr^] -GnRH preprohormone 

proteins and not to other proteins. Homologous proteins to 
the H. burtoni [His^,Trp'^,Tyr®] -GnRH preprohormone and the 
treeshrew tHis^,Trp'^,Tyr^] -GnRH preprohormone encompass 
[His^,Trp'',Tyr®] -GnRH preprohormone species, but do not 

30 include other proteins. Similarly, antibodies raised to the 
H. Jburtoni ISer®] -GnRH preprohormone with the amino acid 
sequence depicted in Seq. ID No. 20 can be selected to obtain 
antibodies specifically immunoreactive with [Ser®] -GnRH 
preprohormone proteins and not to other proteins. Homologous 

35 proteins to the H. biirtoxx± [Ser®] -GnRH preprohormone encompass 
[Ser®]-GnRH preprohormone species, but do not include other 
proteins . 
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A variety of imntunoassay formats may be used to 
select antibodies specifically immunoreactive with a 
particular protein. For example, solid-phase ELISA 
immunoassays are routinely used to select monoclonal 
5 antibodies specifically immunoreactive with a protein. See 
Harlow and Lane (1988) Antibodies, A Laboratory Manual, Cold 
Spring Harbor Publications, New York, for a description of 
immunoassay formats and conditions that can be used to 
determine specific immunoreactivity. 
10 "Biological sample" as used herein refers to any 

sample obtained from a living organism or from an organis::; 
that has died. Examples of biological samples include body 
fluids and ti/'sue specimens. 



15 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig, lA is a comparison of known GnRH amino acid 
sequences* The primary structures of the eight known GnRH 
forms cuid of the first nine amino -terminal residues from yeast 
of-mating factor are shown. 

20 Pig. IB shows the predicted amino acid sequence of 

the preprohormone for H. burtoni tTrp"^, Leu®] -GnRH as compared 
with the [HisS,Trp*',Tyr®] -GnRH peptide. The two residues at 
which these forms differ are indicated in bold print. 
Degenerate 5' oligonucleotides used for PGR are shown below. 

25 Fig. 2 shows the cDNA and predicted amino acid 

sequence of H. burtoni prepro[His^,Trp'',Tyr®] -GnRH. The 
functional domains of prepro [His^,Trp"',Tyr®] -GnRH are 
illustrated and compared to H. Jburtoni prepro [Trp'', Leu®] -GnRH. 
Amino acids are niambered beginning at the first residue in the 

30 GnRH decapeptide* The fxxnctional domains of both 

preprohormones are illustrated. Hydrophobic signal sequences 
(negative numbers) are directly followed by the GnRH 
decapeptide region (bold) and residues involved with 
posttremslational processing. The GAP peptide sequences 

35 follow the decapeptide and posttranslational processing 

sequences. A second processing site (underlined) indicates 
coding sequences for two novel peptides in the 
t His^ , Trp'^ , Tyr® ] - GnRH preprohormone . 
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Fig. 3 shows the results of high stringency Northern 
blot analysis using total RNA from ventral, dorsal or whole 
brain, hybridized to ^^p. labelled prepro [His^,Trp'',Tyr^] -GnRH 
riboprobe. Lanes 1, 2, and 3 were loaded with 5, 10 and 20 
Hg, respectively, of total RNA extracted from a pool of 3 
brains. Lanes 4 through 7 contain 10 /xg of total RNA each 
from one half of a single brain. Lanes 4 and 6 contain 
ventral brain RNA while lanes 5 and 7 contain RNA from the 
corresponding dorsal halves. One band of approximately 530 
bases is seen in lanes containing ventral or total brain RNA. 
No signal was detected in total RNA extracted from dorsal 
brain halves, localizing the expression of this transcript 
exclusively to more ventral brain regions. Methylene blue 
staining of ribosomal bands confirmed that the equal amounts 
15 of total RNA were loaded into lanes 4-7 of the gel. 

Fig. 4 shows localization of prepro 
[His5,Trp',Tyr®] -GnRH roRNA in H. burton! brain. TOP: 
schematic view of a mid-sagittal section from H. Jburtoai brain 
showing the three populations of neurons immunoreactive for 
GnRH. For simplicity, the clusters are represented within the 
same plgme although the ihes encephalic "population lies lateral 
to both the terminal nerve and POA populations. The terminal 
nerve GnRH population is indicated by is indicated by a, the 
POA population by and the mesencephalic population by t. 
25 The bottOTi six panels show the labelling with each of the two 
GnRH gene probes in the three brain regions with cells known 
to contain GnRH from immunocytochemistry. LEFT COLUMN: in 
situ hybridization with a prepro [His^,Trp'^,Tyr®] -GnRH 
digoxygenin-OTP riboprobe. Only the mesencephalic population 
30 ufas labeled. Scale bar » 100 fm. RIGHT COLUMN: in situ 

hybridization with a prepro ITrp"', Leu®] -GnRH digoxygenin-OTP 
riboprobe. Only the terminal nerve population was ladaeled. 
Scale bar » 100 fim. Note that neither probe labels the 
POA/hypothalamic group of cells. 
35 Pig. 5 shows localization of prepro [Ser®] -GnRH mRNA 

in H. jburtoni brain. Midsagittal sections are shown in the 
panels. The top two panels show hybridization with a prepro 
[Ser®] -GnRH riboprobe in subordinate and daninant male fish. 



20 
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The bottom two panels show hybridization of the riboprobe in 
the terminal nerve area and the mesencephalon regions of the 
H. burtoni brain of dominant males. 



5 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

This invention provides isolated nucleic acid 
sequences encoding precursor proteins for the chicken II form 
of GnRH, which is referred to here as the [His^,Trp'',Tyr®] - 
GnRH preprohormone, and for the [Ser®] -GnRH preprohormone . 

10 The seqpiences can be used in a number /of applications. For 
instance, they can be used for the recombinant production of 
the preprohormone polypeptides. In addition, transgenic 
animals {e.g., fish and chickens, and non-human mammals) caai 
be generated using the sequences provided here. Diagnostic 

15 assays for detecting nucleic acids encoding the 

tHis^,Trp'^,Tyr®] -GnRH preprohormone or the [Ser®] -GnRH 
preprohormone, as well as assays for these preprohormone 
proteins are also provided. These assays are particularly 
useful in the diagnosis of reproductive diseases or 

20 reproductive capacity in animals. Compositions and methods 
for using the sequences provided here are described in more 
detail below. 



A} — GnRH preprohormone polvoeptidea 

25 As described above, there are eight different forms 

of GnRH that have been isolated and characterized from 
vertebrate species (see figure 1) . A precursor for an 
additional form of GnRH, [Ser®] -GnRH, is also described 
herein. Figure 1 shows all decapeptide sequences except 

30 tSer®] -GnRH with reference to the ma-imalian GnRH sequence. 

tSer®] -GnRH is a newly discovered fom of GnRH, so that there 
are now 9 known forms of GnRH. Residues which differ from 
those in the mammalian form are identified both by position 
and residue. Thus, teleost GnRH is referred to as 

35 iTrp*^, Leu®] -GnRH because, in teleosts, the Leu in position 7 

is Trp and the Arg in position 8 is Leu when compared with the 
mammalian GnRH sequence. As can be seen in GnRH, positions 1, 
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2, 4, 9, and 10 are invariant, and positions 3 and 7 show only 
conservative changes. 

The predicted amino acid sequence for the 
[His^^Trp'^^Tyr®] -GnRH preprohormone from the teleost fish, H. 
5 jburtoni, is shown in Figure 2 (Seq. ID No. 2). In addition, 
the predicted amino acid sequence for the treeshrew 
[His^,Trp'^,Tyr®] -GnRH preprohormone is shown in Seq ID No. 18. 
The precursor of a additional form of GnRH, [Ser®] -GnRH, is 
shown in Seq. ID No. 20. The GnRH preprohormones have a 

10 Characteristic structure consisting of three separate domains. 
The N- terminal domain consists of a signal peptide region 
which is a common feature of many secretory proteins. The 
second domain consists of the [His^,Trp'^,Tyr®] -GnRH 
decapeptide followed immediately by a 3 amino acid amidation 

15 and precursor processing site. The third, C- terminal, domain 
consists of a GnRH associated peptide or GAP peptide. The GAP 
peptide region of the lHis^,Trp'^,Tyr®] -GnRH preprohormone may 
generate two peptides because of an additional potential 
proteolytic processing site in this region of the 

20 preprohormone. Thus, a single [His^,Trp'^,Tyr®] -GnRH 

preprbhomdne GAP peptide consisting of the entire sequence of 
this region of the protein may be produced. Alternatively, 
there may be two [His^,Trp*^,Tyr®l -GnRH preprohormone GAP 
peptides present. 

25 The [His^,Trp'^,Tyr®] -GnRH preprohormone and the 

[Ser®] -GnRH preprohormone represent two members of a family of 
GnRH preprohormones. Homology between the H. burtoni and the 
treeshrew lHis^,Trp'^,Tyr®] -GnRH preprohormones and the H. 
Jburtoni [Ser®] -GnRH preprohormone was determined by the PASTA 

30 and GAP computer programs (version 7.3 Unix, Genetics Computer 
Group, 575 Science Drive, Madison, Wisconsin, USA) . Using 
this procedure there is a 59% homology, and a 44% amino acid 
identity between the amino acid sequences of H. Jburtoni and 
treeshrew [His^,Trp'',Tyr®] -GnRH preprohormones. Also using 

35 this procedure, there is a 55% homology and a 33% identity 
between the amino acid sequences of the H. Jburtoni 
tHis5,Trp*^,Tyr®] -GnRH preprohormone and the JI. Jburtoni 
[Ser®] -GnRH preprohormone. 
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The terms " [His^^Trp'^, Tyr®] -GnRH preprohormone " , 
" [His^,Trp'^,Tyr®] -GnRH preprohormone polypeptides" or "chicken 
II GnRH preprohormone" as used herein refer to all polypeptide 
precursor forms of [His^,Trp'',Tyr®] -GnRH, but exclude the 
5 mature [His^,Trp*',Tyr®] -GnRH decapeptide. Similarly, the 
terms " [Ser®] -GnRH preprohormone" or " tSer®] -GnRH 
preprohormone polypeptides" as used herein refer to all 
polypeptide precursor forms of [Ser®] -GnRH, but exclude the 
mature [Ser®] -GnRH decapeptide. These terms also refer to 

10 biologically active fragments of the precursor proteins for 
[His^,Trp*^,Tyr®] -GnRH and [Ser®] -GnRH respectively. 
Significant biological activities include GnRH activity, 
latent GnRH activity and immunological activity. Latent GnRH 
activity means that the biologically active fragment can be 

15 proteolytically processed to yield active GnRH. 

Immunological activity refers to immunoreactivity 
with an antibody raised to a full-length [His^,Trp'^,Tyr®] -GnRH 
or [Ser®] -GnRH preprohormone or to segments of these 
preprohormones . A segment of a [His^^Trp'^^Tyr®] -GnRH 

20 preprohormone or [Ser®] -GnRH preprohormone will ordinarily 

comprise at least about 5 contiguous amino acids, typically at 
least about 7 contiguous amino acids, more typically at least 
about 9 contiguous amino acids, usually at least about 11 
contiguous amino acids, prefered^ly at least about 13 

25 contiguous amino acids, more preferably at least about 16 

contiguous amino acids, and most preferably at least about 20 
to 30 or more contiguous amino acids from the preprohormone. 
Segments of a particular domain will be segments of the 
appropriate size within the corresponding domain. 

30 Biologically active fragments include the [His^,Trp'',Tyr®] - 
GnRH preprohormone and [Ser®] -GnRH preprohormone GAP peptide 
or peptides. 

The terms "GnRH related peptide", "GAP" or "GAP 
peptide" refer to the peptide of the C- terminal domain of a 
35 GnRH preprohormone, as described above for the 

[His^,Trp'^,Tyr®] -GnRH preprohormone and the [Ser®] -GnRH 
preprohormone. Previously known GAP peptides are further 
described in Sherwood, N.A. , et al., supra. The terms 
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" [His^,Trp*^,Tyr®l -GnRH preprohormone GAP peptide" or "chicken 
II GnRH preprohormone GAP peptide*' refer to the GAP peptide or 
peptides derived from the [His^,Trp'^,Tyr®] -GnRH preprohormone. 
The term " [Ser^] -GnRH preprohormone GAP peptide" refers to the 
5 GAP peptide or peptides derived from the [Ser®3 -GnRH 

preprohormone. The [His^,Trp'^,Tyr®] -GnRH preprohormone and 
[Ser®] -GnRH preprohormone GAP peptides are produced by the 
proteolytic processing of [His^^Trp*^, Tyr®3 -GnRH preprohormone 
and [Ser®] -GnRH preprohormone, respectively. The 

10 [His^,Trp'^,Tyr®] -GnRH preprohormone GAP peptide is a 

biologically active [His^,Trp'^,Tyr®] -GnRH preprohormone 
fragment, as defined above, and is therefore included in the 
definition of [His^,Trp'^,Tyr®] -GnRH preprohormone 
polypeptides. Similarly, the [Ser®] -GnRH preprohormone GAP 

15 peptide is a biologically active [Ser®] -GnRH preprohormone 

fragment, as defined above, and is therefore included in the 
definition of [Ser®] -GnRH preprohormone polypeptides. 

The above -defined terms refer not only to the 
protein having the amino acid sequence disclosed here, but 

20 also to other proteins that are allelic, nonallelic or species 
variants of the H. burtoixi auid tfeeshrew^^^^I^^ 
preprohormones and the H. Burtozxi [Ser®] -GnRH preprohormone, 
as well as to natural or induced mutant forms of these 
proteins. For example, lHis^,Trp'^,Tyr®] -GnRH preprohormone 

25 polypeptides generally show substantial sequence identity 

(determined as described above) to the amino acid sequence of 
the H. burtojxL lHis^,Trp'^,Tyr®] -GnRH preprohormone shown in 
figure 2 or to the amino acid sequence of the treeshrew 
[His ^,Trp'^,Tyr®] -GnRH preprohormone shown in Seq, ID No. 18. 

3 0 For example , fish [His^ , Trp*^ , Tyr® ] - GnRH preprohormone 

polypeptides generally show substantial secjuence identity to 
the amino acid sequence of the H. burtoni tHis^,Trp'^,Tyr®] - 
GnRH preprohormone shown in figure 2, and mammalian 
IHis^,Trp'',Tyr®] -GnRH preprohormone polypeptides generally 

35 show substantial sequence Identity (determined as described 
above) to the amino acid sequence of the treeshrew 
[His^,Trp'',Tyr®] -GnRH preprohormone shown In Seq. ID No. 18. 
Similarly, [Ser®] -GnRH preprohormone polypeptides generally 
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Show substanti."?! sequence identity to the amino acid sequence 
of the H. burt jai [Ser^] -GnRH preprohormone shown in Seq ID 
No. 20. 

[His5,Trp"',Tyr^] -GnRH preprohormone polypeptides 
will also typically be specifically irr loreactive with 
antibodies raised against the H. burtc. tHis5,Trp'',Tyr®] -GnRH 
preprohormone of figure 2 or against the treeshrew 
rHis5,Trp'',Tyr®] -GnRH preprohormone shown in Seq. ID No. 
t???] . In particular, fish [HisS,Trp',Tyr8] -GnRH 
preprohormone polypeptides will typically be specifically 
immunoreactive with antibodies raised against the H. burtoni 
tHis5,Trp^,Tyr®] -GnRH preprohormone of figure 2, and mammalian 
tHis^,Trp'',Tyr^] -GnRH preprohormone polypeptides will 
typically be specifically immunoreactive with antibodies 
raised against the treeshrew [His^,Trp'',Tyr®] -GnRH 
preprohormone shown in Seq. ID No. 18. Similarly, [Ser^] - 
GnRH preprohormone polypeptides will generally be specifically 
imottinoreactive with antibodies raised against the H. burtoni 
tSer^] -GnRH preprohormone shown in Seq. ID No. 20. 

A [His5,Trp'',Tyrfi] -GnRH preprohormone that 
specifically binds to or that is specifically immunoreactive 
to an antibody generated against a defined immunogen, such as 
an immunogen consisting of the amino acid sequence of Seq. ID 
No. 2 or Seq. ID No. 18 is determined in an immunoassay. The '_. 
irsnunoassay uses a polyclonal antiserum which was raised to 
the protein of Seq. ID No. 2 or Seq. ID No. 18. This 
antiserum is selected to have low crossreactivity against GnRH 
preprohozmones other than tHis5,Trp'',Tyr®] -GnRH 
preprohormones. Crossreactivity to these other forms of GnRH 
preprohormone is removed by immur^oabsorption prior to use in 
the immunoassay. 

In order to produce euitisera to tHis^,Trp'',Tyr®] - 
GnRH precursor for use in an immunoassay, the proteins of Sec 
ID No. 2 and Seq. ID No. 18 are isolated as describ d hereii. 
For example, recombinant protein is produced in a msimnaliar 
cell line. An inbred strain of mice such as balb/c or rab: 
are immunized with the protein of Seq. id No. 2 or Seq. li 
No. 18 using a standard adjuvant, such as Preund's adjuvant. 
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and a standard mouse immunization protocol (see Harlow and 
Lane, supra) . Polyclonal sera are collected and titered 
against the iramunogen protein in an immunoassay, for example, 
a solid phase immunoassay with the immunogen immobilized on a 
5 solid support. Polyclonal antisera with a titer of 10* or 
^greater are selected and tested for their cross reactivity 
against GnRH preprohormones other than [His^,Trp'^,Tyr®] -GnRH 
preprohormone , using a competitive binding immunoassay such as 
the one described in Harlow and Lane, supra, at pages 570-573. 

10 Three non- tHis^,Trp'^,Tyr®] -GnRH preprohormones are used in 

this determination: the mammalian GnRH preprohormone obtained 
from human, rat, and mouse (see Sherwood, et al. (1993), 
supra); the [Trp*^, Leu®] -GnRH preprohormone obtained from 
Salmon and H. Jburtoni, and the [Ser®] -GnRH preprohormone 

15 obtained from H. burtoni. These non- [His^,Trp'^,Tyr®] -GnRH 
preprohormones can be produced as recombinant proteins and 
isolated using standard molecular biology and protein 
chemistry techniques as described herein. 

Immunoassays in the coxi^etitive binding format can 

20 be used for the crossreactivity determinations. For exanple, 
the protein of Seq. ID No. 2 or Seq. ID No. 18 cain be 
immobilized to a solid support. Proteins added to the assay 
con^ete with the binding of the ajitisera to the immobilized 
antigen. The ability of the above proteins to compete with 

25 the binding of the auitisera to the immobilized protein is 

compared to the protein of Seq. ID No. 2 or Seq. ID No. 18. 
The percent crossreactivity for the above proteins is 
calculated, using standard calculations. Those antisera with 
less than 10% crossreactivity with each of the proteins listed 

30 above are selected and pooled. Antisera raised against the 

protein of Seq ID No. 2 and the protein of Seq. ID No. IB are 
pooled separately. The cross -reacting amtibodies are then 
removed from the pooled antisera by Immunoabsorption with the 
above- listed proteins. 

35 The Immunoabsorbed and pooled antisera are then used 

in a coxnpetitive binding immunoassay as described above to 
conspare a second protein to the immunogen protein. In order 
to msLke this coxt^arison, the two proteins are each assayed at 
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a wide range of concentrations and the amount of each protein 
required to inhibit 50% of the binding of the antisera to the 
immobilized protein is determined. For example, if the amount 
of the second protein required is less than 10 times the 
5 amount of the protein of Seq. ID No. 2 required, then the 
second protein is said to specifically bind to an antibody 
generated to an immunogen consisting of the protein of Seq. ID 
No. 2. 

A iSer®] -GnRH preprohormone that specifically binds 
10 to or that is specifically iramuno reactive to an antibody 

generated against a defined immunogen, such as an immunogen 
consisting of the amino acid sequence of Seq. ID No. 20 can 
also determined in an immunoassay, by using a procedure 
similar to that described above for [His^,Trp'',Tyr®] -GnRH 
15 preprohorxnones . In this case, however, polyclonal antisera 
are selected and tested for their cross reactivity against 
GnRH preprohormone s other than [Ser®] -GnRH preprohormone. The 
non- [Ser®] -GnRH preprohormones used in this determination are: 
the mammalian GnRH preprohormone obtained from hxjman, ro", and 
20 mouse (see Sherwood, at al. (1993), supra); the [Trp'^,Lea^] - 
GnRH preprohormone obtained from Salmon and H. burton! , and 
the iHis^jTrp'^fTyr®] -GnRH preprohormone obtained from H. 
Jburtoni and the treeshrew. 

25 B. Nucleic acid compositions for THis^.Trp'^.Tyr^l -GnRH sind 
rser^l -GnRH preprohormone polypeptides 

This invention relates to isolated nucleic acid 
sequences encoding lHis^,Trp'',Tyr®] -GnRH preprohormone and 
[Ser^] -GnRH preprohorxnone polypeptides. The nucleic acid 

30 compositioxis of this invention, whether RNA, cDNA, genomic 

DNA, or a hybrid of the vairious concibinations, may be isolated 
from natural sources or may be synthesized in vitro. The 
nucleic acids claimed may be present in transformed or 
transfected whole cells, in a transformed or transfected cell 

35 lysate, or in a partially purified or substantially pure form. 

The nucleic acid sequences of the invention are 
typically identical to or show substantial sequence identity 
(determined as described above) to the nucleic acid sequence 
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of SEQ ID. No. 1, Seq. ID No. 17, or Seq. ID No. 19. For 
example, nucleic acids encoding [His^,Trp'^,Tyr®] -GnRH 
preprohormone polypeptides and which show substantial secjuence 
identity to Seq. ID No. 1 will typically hybridize to the 
nucleic acid sequence of Seq. ID No. 1 under stringent 
conditions. Similarly, nucleic acids encoding 
[His^,Trp'',Tyr®] -GnRH preprohormone polypeptides and which 
show substantial sequence identity to Seq. ID No. 17 will 
typically hybridize to the nucleic acid sequence of Seq. ID 
No. 17 vmder stringent conditions. With regard to [Ser^] -GnRH 
preprohormones , nucleic acids encoding [Ser®] -GnRH 
preprohormone polypeptides and which show substantial sequence 
identity to Seq. ID No. 19 will typically hybridize to the 
nucleic acid sequence of Seq. ID No. 19 under stringent 
conditions. Stringent conditions are sequence dependent and 
will be different in different circumstances. Generally, 
stringent conditions are selected to be about 5*> C lower than 
the thermal melting point (Tm) for the specific sequence at a 
defined ionic strength and pH. The Tm is the teirqaerature 
(under defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly matched probe. 
Typically, stringent conditions will be those in which the 
salt concentration is at least about 0*02 molar at pH 7 and 
the ten5>erature is at least about 60«^C. As other factors may 
significantly affect the stringency of hybridization, 
including, among others, base coirposition and size of the 
complementary strands, the presence of organic solvents and 
the extent of base mismatching, the combination of parameters 
is more in^jortant than the absolute measure of any one. 

Techniques for nucleic acid manipulation of genes 
encoding these polypeptides such as subcloning nucleic acid 
sequences encoding polypeptides into expression vectors, 
labelling probes, DNA hybridization, and the like are 
described generally in Sambrook, et al.. Molecular Cloning - A 
Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York, 1989, which is 
incorporated herein by reference. This manual is hereinafter 
referred to as "Sambrook, et al." 
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There are various methods of isolating the DNA 
sequences encoding [His^iTrp'^^Tyr®] -GnRH preprohormone and 
[Ser®] -GnRH preprohormone polypeptides. For exaitrple, the DNA 
is isolated from a genomic or cDNA library using labelled 
5 oligonucleotide probes having sequences complementary to the 
sequences disclosed here. Restriction endonuclease digestion 
of genomic DNA or cDNA containing the [His^,Trp'',Tyr®] -GnRH 
preprohormone gene or the [Ser®] -GnRH preprohormone gene can 
be used to isolate nucleic acids encoding these proteins. 

10 Since the DNA sequences encoding [His^.Trp'^^Tyr®] -GnRH and 
tSer®] -GnRH preprohormones are provided here, a panel of 
restriction endonucleases can be constructed to give cleavage 
of the DNA in the desired regions • After restriction 
endonuclease digestion, DNA encoding [His^,Trp'^,Tyr®] -GnRH and 

15 [Ser®] -GnRH preprohormones is identified by its ability to 
hybridize with nucleic acid probes, for example on Southern 
blots, and these DNA regions are isolated by standard methods 
familiar to those of skill in the art. See Sambrook, et al. 

Various methods of amplifying target sequences, such 

20 as the polymerase chain reaction can also be used to prepare 
tHis5,Trp'^,Tyr®]-GnRH or [Ser®] -GnRH preprohormone DNA. 
Polymerase chain reaction technology (PGR) is used to amplify 
nucleic acid sequences of the IHis^,Trp'^,Tyr®] -GnRH or [Ser®] - 
GnRH preprohormone polypeptides directly from mRNA, from cDNA, 

25 and from genomic libraries or cDNA libraries. The isolated 
sequences encoding tHis^,Trp'',Tyr®l -GnRH preprohormone or 
[Ser®] -GnRH preprohormone may also be used as templates for 
PGR amplification. 

Appropriate primers and probes for ainplifying 

30 nucleic acids encoding nucleic acids encoding the 

[His^,Trp'^,Tyr®]-GnRH or the [Ser®] -GnRH preprohormone 
polypeptides can be generated from analysis of the DNA 
sequences. In brief, oligonucleotide primers complementary to 
the two 3' borders of the DNA region to be amplified are 

35 synthesized. The polymerase chain reaction is then carried 
out using the two primers. See PCR Protocols: A Guide to 
Methods and Applications. (Innis, M, Gelfand, D., Sninsky, J. 
and White ^ T. , eds.). Academic Press, San Diego (1990). 
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Primers can be selected to amplify the entire regions encoding 
the full-length [His^,Trp'^,Tyr®] -GnRH or [Ser®] -GnRH 
preprohormones or to amplify smaller DNA segments of these 
preprohormones as desired. 
5 Oligonucleotides for use as probes are chemically 

synthesized according to the solid phase phosphoramidite 
triester method first described by Beaucage, S.L. and 
Carruthers, M.H, , 1981, Tetrahedron ietts. , 22 (20) : 1859-1862 
using an automated synthesizer, as described in 

10 Needham-VanDevanter, D,R., et al., 1984, Nucleic Acids Res., 
12:6159-6168. Purification of oligonucleotide is by either 
native acrylamide gel electrophoresis or by anion- exchange 
HPLC as described in Pearson, J.D. and Regnier, F.E., 1983, J. 
Chrom., 255:137-149. 

15 The sequence of the synthetic oligonucleotide can be 

verified using the chemical degradation method of Maxam, A.M. 
and Gilbert, 1980, in W. , Grossman, L. cuid Moldave^ D., eds. 
Academic Press, New York, Methods in Enzymology, 65:499-560. 

Other methods known to those of skill in the art may. 

20 also be used to produce and isolate nucleic acids encoding 
[His^VTrp'^/Tyr®] -GnRH pre [Ser®] -GhBH 

preprohormone . See Sambrook, et al. for a description of 
other techniques for the isolation of DNA encoding specific 
protein molecules. 

25 

— Expressio n of rHis^.Tro'^.Tvr^l >GnRH nreprohormone and 
rSer^l-QnRH prepro hormone. 

Once the DNA encoding [His^,Trp'^,Tyr®] -GnRH 
preprohormone or the [Ser®] -GnRH preprohormone is isolated and 
30 cloned, one may express these preprohormones in a variety of 
recombinantly engineered cells • It is e3q)ected that those of 
skill in the art are knowledgeable in the nxjmerous expression 
systems available for expression of the DNA encoding these 
preprohormones. No attenpt to describe in detail the various 
35 methods known for the expression of proteins in prokaryotes or 
eukaryotes is made here. 

In brief summary, the expression of natural or 
synthetic nucleic acids encoding IHis^,Trp'','iyr®] -GiiRH or 
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iSer®] -GnRH preprohormone will typically be achieved by 
operadDly linking the DNA or cDNA to a promoter (which is 
either constitutive or inducible) , followed by incorporation 
into an eaqjression vector • The vectors can be suitcQ)le for 
5 replication and integration in either prokaryotes or 

eukaryotes. Typical expression vectors contain transcription 
and translation terminators, initiation. sequences, and 
promoters useful for regulation of the expression of the 
polynucleotide sequence encoding [His* rrp'^,Tyr®] -GnRH or 

10 [Ser®] -GnRH preprohormone polypeptides. To obtain high level 
expression of a cloned gene, such as those polynucleotide 
sequences encoding [His^,Trp'^,Tyr®] -GnRH or [Ser®3 -GnRH 
preprohormone, it is desirable to construct expression 
plasmids which contain, at the minimum, a strong promoter to 

15 direct transcription, a ribosome binding site for 

translational initiation, and a transcription/translation 
terminator. The expression vectors may also comprise generic 
expression cassettes containing at least one independent 
terminator sequence, sequences permitting replication of the 

20 plasmid in both eukaryotes and prokaryotes. I.e., shuttle 
vectors, and selection markers for both prokaryotic and 
eukaryotic systems. See Sambrook et al. Examples of 
expression of [His5,Trp'^,Tyr®] -GnRH and [Ser^l -GnRH 
preprohormone polypeptides in both prokaryotic and eukaryotic 

25 systems are described below. 

1. Expression in Prokaryotes 

A variety of procaryotic e3q)ression systems may be 
used to express [His^,Trp'',Tyr®3 -GnRH preprohormone or [Ser®] • 

30 GnRH preprohormone polypeptides. Exaxrples include E. coli, 
Bacillus, Streptomyces, and the like. For exasiple, 
IHis^,Trp'',Tyr®] -GnRH preprohormone or [Ser^] -GnRH 
preprohormone polypeptides may be expressed in E. coli. As 
another example, these preprohormone polypeptides may be 

35 expressed in fish cells. Other fish proteins such as rainbow 
trout growth hormone have been successfully expressed in E. 
coli (See, e.g., Agellon et al. (1986) DNA 5:463-477 and U.S. 
Patent No, 4,849,359). 
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It is essential to construct expression plasmids 
which contain, at the minimum, a strong promoter to direct 
transcription, a ribosome binding site for translational 
initiation, and a transcription/translation terminator. 
Examples of regulatory regions suitable for this purpose in E. 
coll are the promoter and operator region of the B. coll 
triptophan biosynthetic pathway as described by Yanofsky, €• , 
1984, J. Bacterial., 158:1018-1024 and the leftward promoter 
of phage lambda (Pj^) as described by Herskowitz, I. and Hagen, 
D., 1980, Ann. Rev. Genet., 14:399-445. The inclusion of 
selection markers in DNA vectors transformed in E. coll is 
also useful • Examples of such markers include genes 
specifying resistance to airqpicillin, tetracycline, or 
chloramphenicol, See Sambrook et al. for details concerning 
selection markers for use in E. coll. 

[His^,Trp"^,Tyr®] -GnRH preprohormone and tSer^] -GnRH 
preprohormone polypeptides produced by prokaryotic cells may 
not necessarily fold properly. During purification from E. 
coll, the expressed protein may first be denatured and then 
renatured. This can be accomplished by solubilizing the 
bacterially produced proteins in a chabtropic agent such as 
guanidine HCl and reducing all the cysteine residues with a 
reducing agent such as beta-mercaptoethanol. The protein is 
then renatured, either by slow dialysis or by gel filtration. 
See U,S, Patent No. 4, 511, 503 . 

Detection of the expressed antigen is achieved by 
methods known in the art as radioimmunoassays, or Western 
blotting techniques or immunoprecipitation. Purification from 
JB. coll can be achieved following procedures described in U.S. 
Patent No. 4,511,503. 

2. Expreggion in EukaryntPg 

A variety of eukaryotic expression systems such as 
yeast, insect cell lines, bird, fish, and mammalian cells, are 
known to those of skill in the art. As explained briefly 
below, [His^,Trp'',Tyr®]-GnRH preprohormone or [Ser®] -GnRH 
preprohormone polypeptides may also be e3q>ressed in these 
eukaryotic systems. 
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Synthesis of heterologous proteins in yeast is well 
known and described. Methods in Yeast Genetics, Sherman, F. , 
et al., Cold Spring Harbor Laboratory, (1982) is a well 
recognized work describing the various methods availsUale to 
produce the preprohormone in yeast. 

Suitable vectors usually have expression control 
sequences, such as promoters, including 3 -phosphogly cerate 
kinase or other glycolytic enzymes, and an origin of 
replication, termination sequences and the like as desired. 
For instance, suitable vectors are described in the literature 
(Botstein, et al., 1979, Gene, 8:17-24; Broach, et al., 1979, 
Gene, 8:121-133). 

Two procedures are used in transforming yeast cells. 
In one case, yeast cells are first converted into protoplasts 
using zymolyase, lyticase or glusulase, followed by addition 
of DNA and polyethylene glycol (PEG) . The PEG- treated 
protoplasts are then regenerated in a 3% agar medium under 
selective conditions. Details of this procedure are given in 
the papers by J.D. Beggs, 1978, Nature (London), 275:104-109; 
and Hinnen, A., etal., 1978, Proc. Natl. Acad. Sci. USA, 
75:1929-1933. The second procedure does not involve removal 
Of the cell wall. Instead the cells are treated with lithium 
chloride or acetate and PEG and put on selective plates (Ito, 
H., etal., 1983, J. Sact., 153:163-168). 

tHis^,Trp'',Tyr®l -GnRH preprohormone or [Ser®] -GnRH 
preprohormone polypeptides, once expressed, can. be isolated 
f rcffli yeast by lysing the cells and applying standard protein 
isolation techniques to the lysates. The monitoring of the 
purification process can be accomplished by using Western blot 
techniques or radioimmunoaissays of other standard immunoassay 
techniques . 

The sequences encoding the [His^,Trp'',Tyr®] -GnRH 
preprohormone or [Ser^] -GnRH preprohormone polypeptides can 
also be ligated to various escpression vectors for use in 
transforming cell cultures of, for instance, mammalian, 
insect, bird or fish origin. Illustrative of cell cultures 
useful for the production of the polypeptides are mammalian 
cells. Mammaliem cell systems often will be in the form of 
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monolayers of cells although mammalian cell suspensions may 
also be used. A number of suitable host cell lines capable of 
secreting intact proteins have been developed in the art, and 
include the CHO cell lines, various human cells such as COS 
5 cell lines, HeLa cells, myeloma cell lines, Jurkat cells, etc. 
Expression vectors for these cells can include expression 
control sequences, such as an origin of replication, a 
promoter {e.g., a HSV tk promoter or pgk (phosphoglycerate 
kinase) promoter), an enhancer (Queen et al . (1986) Jjnmunol. 

10 Jeev. 89:49), and necessary processing information sites, such 
as ribosome binding sites, RNA splice sites, polyadenylation 
sites {e.g., an SV40 large T Ag poly A addition site), and 
transcriptional tentiinator sequences. 

Other animal cells useful for production of the 

15 preprohormone are available, for instance ^ from the American 

Type Culture Collection Catalogue of Cell Lines and Hybridomas 
{7th edition, 1992) . For example, fish cells that can be used 
include cells derived from rainbow trout, salmon, and the 
like. Suitable promoters suid vectors are described for 

20 instance, in Friedenreich et al., (1990) Nuc. Acids Res. 
18:3299-3305. 

Appropriate vectors for eaqpressing preprohormone 
polypeptides in insect cells are usually derived from the SF9 
baculovirus. Suitable insect cell lines include mosquito 

25 larvae, silkworm, armyworm, moth and Drosophila. cell lines 

such as a Schneider cell line (See Schneider J. Ewbryol. Exp. 
Moxpbol. 27:353-365 (1987). 

As indicated above, the vector, e.g., a plasmid, 
which is used to transform the host cell, preferably contains 

30 DNA sequences to initiate transcription and sequences to 

control the translation of the preprohormone gene sequence. 
These sequences are referred to as e3q>ression control 
sequences . 

As with yeast, when higher animal host cells are 
35 enrployed, polyadenlyation or transcription terminator 

sequences from known mammalian genes need to be incorporated 
into the vector. An exartple of a terminator sequence is the 
polyadenlyation sequence from the bovine growth hormone gene. 
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Sequences for accurate splicing of the transcript may also be 
included. An example of a splicing sequence is the VPl intron 
from SV40 (Sprague, J. etal., 1983, J. Virol. 45: 773-781) • 
Additionally, gene sequences to control replication 
5 in the host cell may be incorporated into the vector such as 
those found in bovine papilloma virus type- vectors. 
Saveria-Caitipo, M., 1985, "Bovine Papilloma virus DNA a 
Eukaryotic Cloning Vector" in DNA Cloning Vol. II a Practical 
Approach Ed. D.M. Glover, IRL Press, Arlington, Virginia pp. 
10 213-238. 

The host cells are competent or rendered competent 
for transformation by various means. There are several 
well-known methods of introducing DNA into animal cells. 
These include: calcium phosphate precipitation, fusion of the 

15 recipient cells with bacterial protoplasts containing the DNA, 
treatment of the recipient cells with liposomes containing the 
DNA, DEAE dextran, eiectroporation and micro -inject ion of the 
DNA directly into the cells. 

The trauisformed cells are cultured by means well 

20 known in the art. Biochemical Methods in Cell Culture and 
Virology, Kuchler, R.J., Dowden, Hutchinson and Ross, Inc., 
(1977) . The e3q>ressed preprohormone polypeptides are isolated 
from cells grown as suspensions or as monolayers. The latter 
are recovered by well known mechanical, chemical or enzymatic 

25 means. 



D. Purification of rHis^.Trp'^.Tyr^l -Gn Xm preprohormone or 
rser^l -GnRH preprohormone polvpetstideg 

The polypeptides produced by recombineuit DNA 

30 technology may be purified by standard techniG[ues well known 
to those of skill in the art. Recombinantly produced 
preprohormone polypeptides can be directly expres>3d or 
esqpressed as a fusion protein. The protein is then purified 
by a combination of cell lysis (e.gr., sonication) and aixinity 

35 chromatography. For fusion products, subsequent digestion of 
the fusion protein with an appropriate proteolytic enzyme 
release the desired polypeptide. 
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The polypeptides of this invention may be purified 
to substantial purity by standard techniques well known in the 
art, including selective precipitation with such substances as 
ammonium sulfate, column chromatography, immunopurif ication 
methods, and others. See, for instance, R. Scopes, Protein 
Purification: Principles and Practice, Springer -Verlag: New 
York (1982), incorporated herein by reference. 

E. Production of rHi s^^Tjrp'^^TVrQl -GnRH preprohormone or 
[Ser^ l -GnRH preprohormone polypeptides bv protein 
chemistry techniques 

The polypeptides of the invention can be 
synthetically prepared in a wide variety of ways. For 
instance polypeptides of relatively short size, can be 
synthesized in solution or on a solid support in accordance 
with conventional techniques. Various automatic synthesizers 
are commercially available and can be used in accordance with 
known protocols. See, for example, Stewart and Young, Solid 
Phase Peptide Synthesis, 2d. ed., Pierce Chemical Co. (1984). 

^- Modification of nucleic acid and polypeptide aeoruenceg 

The nucleotide sequences used ^ to transfect the host 
cells used for production of recombinant preprohormone 
polypeptides can be modified according to standard techniques 
to yield preprohormone polypeptides with a variety of desired 
properties. The polypeptides of the present invention can be 
readily designed and manufactured utilizing various 
recombinant DNA techniques well known to those skilled in the 
art. For exaiiple, the polypeptides can vary from the 
naturally- occurring sequence at the primary structure level by 
amino acid insertions, substitutions, deletions, and the like. 
These modifications can be used in a number of combinations to 
produce the final modified protein chain. 

The amino acid sequence variants can be prepared 
with various objectives in mind, including facilitating 
purification and preparation of the recombinant polypeptides. 
The modified polypeptides are also useful in, for example, 
modifying plasma half -life, improving therapeutic efficacy. 
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and lessening the severity or occurrence of side effects 
during therapeutic use. The amino acid sequence variants are 
usually predetermined variants not found in nature but exhibit 
the same immunogenic activity as naturally occurring 
polypeptides. For instance, polypeptide fragments comprising 
only a portion (usually at least about 60-80%, typically 90- 
95%) of the primary structure may be produced. In general, 
modifications of the sequences encoding the polypeptides may 
be readily accomplished by a variety of well-known techniques, 
such as site-directed mutageresis (See, Gillman and Smith, 
Gene 8:81-97 (1979) and Roberts, S. et al., Nature 328:731-734 
(1987)) . 

G- Production of Transgenic Non-human Animals 

The invention also encompasses methods and 
polynucleotide constructs which are employed for generating 
trcuisgenic non- human animal? which express the preprohormone 
polypeptides of the invention. The constructs are used to 
produce transgenic non-human mammals, birds {e.g., chickens), 
fish and the like. Incorporation of nucleotide sequences of 
the invention allow modification of the reproductive and other 
characteristics of the transgenic animals. For example, fish 
that do not readily spawn in captivity can be induced to 
reproduce . 

Appropriate constructs and methods for production of 
transgenic animals are known. Typically, the coding sequence 
of interest is operably linked to expression regulatory 
sequences. In such transgenes, the expression regulatory 
sequence is at least the minimal sequences required for 
efficient cell -type specific ea^ression, which generally are 
at least a promoter and sequences upstream of the promoter, 
which provide for efficient e3q)ression in the target cells. 
Usually the sequences upstream of the promoter are used 
contiguously, although various deletions amd rearrangements 
Ccui be employed. Some desired regulatory elements (e.g., 
enhancers, silencers) may be relatively position- insensitive, 
so that the regulatory element will fiinction correctly even if 
positioned differently in a transgene than in the 
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corresponding germline gene. For example, an enhancer may be 
located at a different distance from a promoter, in a 
different orientation, and/or in a different linear order. 
For example, an enhancer that is located 3' to a promoter in 
5 germline configuration might be located 5' to the promoter in 
a trajisgene. 

Typically, expression regulation sequences are 
chosen to produce tissue-specific or cell type-specific 
expression of the recombinant DNA. Once a tissue or cell type 

10 is chosen for expression, expression regulation sequences are 
chosen. Generally, such expression regulation sequences are 
derived from genes that are expressed primarily in the tissue 
or cell type chosen. Preferably, the genes from which these 
expression regulation sequences are obtained are expressed 

15 substantially only in the tissue or cell type chosen, although 
secondary expression in other tissue and/or cell types is 
acceptable if expression of the recoiribinant DNA in the 
transgene in such tissue or cell type is not detrimental to 
the transgenic animal. 

20 Particularly preferred expression regulation 

sequences are those endogenous to the species of animal to be 
msuiipulated. However, expression regulation sequences from 
other species such as those from human genes may also be used. 
In some instances, the expression regulation sequences and the 

25 recombinant DNA sequences (either genomic or cDNA) are from 
the same species. Alternatively, the e^qjression regulation 
sequences and recombinant DNA sequences (either cDNA or 
genomic) are obtained from different species. In such cases, 
the expression regulation and recombinant DNA sequence are 

30 heterologous to each other. 

In certain embodiments, it is desirable to use gene 
targeting, mediated by homologous recombination between a 
targeting polynucleotide construct and a homologous 
chromosomal sequence, to replace an endogenous gene with the 

35 gene encoding a mutant of the preprohormone gene. Methods and 
materials for preparing such constructs are known by those of 
skill in the art and are described in various references. 
See, e.g., Thcanas et al.. Cell 51:503 (1987) and Capecchi, 
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Science 244:1288 (1989). Homologous targeting constructs have 
at least one region having a sequence that substantially 
corresponds to, or is substantially complementary to, a 
predetermined endogenous target gene sequence (e.g., an exon 
sequence, an enhancer, a promoter, an intronic sequence, or a 
flanking sequence of the target gene) . Such a homology region 
serves as a template for homologous pairing and recombination 
with substantially identical endogenous gene sequence (s). In 
the targeting of transgenes, such homology regions typically 
flank the replacement region, which is a region of the 
targeting transgene that is to undergo replacement with the 
targeted endogenous gene sequence. Thus, a segment of the 
targeting transgene flanked by homology regions can replace a 
segment of the endogenous gene sequence by double crossover 
homologous recombination. 

The constructs described above are introduced into 
pluripotent or totipcrent cells using standard techniques. 
Briefly, this technology involves the insertion of the desired* 
transgene construct into an appropriate cell line (e.g-., 
mammalian embryonic stem (ES) cells or fertilized oocytes) 
that is capable of differentiating into germ cell tissue. 
Methods of introducing transgenes into embryonal target cells 
include microinjection of the transgene into the pronuclei of 
fertilized oocytes or nuclei of ES cells of the non- human 
animal. Such methods for murine species are well known to 
those skilled in the art. Alternatively, the transgene may be 
introduced into cui animal by infection of zygotes with a 
retrovirus containing the transgene (Jaenisch, R. (1976) Proc. 
Natl. Acad. Sci. DSA 73:1260-1264). 

The production of transgenic non- human mammals 
(e.g., mice, cows, pigs, and the like) are described in 
International Application No. WO/08216 and Krimpenfort, et 
al.. Biotechnology 9:844-847 (1991). Germ 1 transgenesis 
of both chickens and japanese quail has been -ascribed. For a 
general description of the production oi trajMgenic birds. 
See, Shuman (1991) Experientia 47:897-905. 

In certain preferred embodiments, transgenic fish 
are generated. A variety of exar:$>les of the production of 
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transgenic fish have been reported {See, e.g., Guyranard et al. 
(1989) Biochimle 71:857-863 and Rokkones et al. (1989) J. 
Comp. Physiol. B. 158;751-785. Vectors can be constructed 
using mammalian promoters, e.g., from the murine 
metallothionen gene, or viral promoters, e.g., Rous sarcoma 
virus and simian virus 40 (see, e.g., Stuart et al. (1988) 
Development 103:403-412 and Yoon et al. (1990) Aguaculture 
85:21-33) . Alternatively, vectors comprising entirely piscine 
regulatory sequences can be used (Liu et al. (1990) 
Bio/Teclmology 8:1268-1272). 

The appropriate construct comprising an esqpression 
cassette containing the sequences encoding the preprohormone 
is introduced by microinjection into recently fertilized eggs 
isolated from spawning fish of the appropriate species (e.g., 
salmon, trout, and the like). The injected embryos are reared 
in the appropriate medium and allowed to develop into mature 
fish using standard aquacultural techniques. 

Tissue sanples from the tramsgenic animals can be 
analyzed using standard techniques for detecting the presence 
or absence of a target sequence. For instance. Fluorescent In 
Situ Hybrxdizatioh (FISH) can be used to detect the transgene. 
Several guides to PISH techniques are available, e.g.. Gall et 
al. Meth. Enzymol., 21:470-480 (1981) and Angerer et al. in 
Genetic Engineering: Principles and Methods Setlow and 
Hollaender, Eds. Vol 7, pgs 43-65 (Plenum Press, New York 
1985) . 

The sequences can also be detected by PGR using 
primers and probes specific for the transgene. Standard PGR 
methods useful in the present invention are described in PCR 
Protocols: A Guide to Methods and Applications (Innis et al., 
eds.. Academic Press, San Diego). 

H. Pharmace utical CpmpogifcionB 

The preprohormone polypeptides (e.g., GAP peptides) 
of the present invention can also be administered to control 
reproductive ftmction. Thus, the polypeptides can be prepared 
in pharmaceutical compositions and administered using methods 
well known in the art. For instance, the pharmaceutical 
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compositions can be administered orally with feed for the 
regulation of reproductive function in farm animals such as 
chickens. The compositions can also be suitable for 
parenteral administration. The pharmaceutical compositions 
5 can be administered in a variety of unit dosage forms 

depending upon the method of administration. For example, 
unit dosage forms suitable for oral administration include 
powder, tablets, pills, and capsules. 

Suitable pharmaceutical formulations for use in the 

10 present invention are found in Remington' s Pharmaceutical 

Sciences, Mack Publishing Company, Philadelphia, PA, 17th ed. 
(1985) . A variety of pharmaceutical coirpositions comprising 
•preprohormone polypeptides of the present invention and 
pharmaceutically effective carriers can be prepared. 

15 Pharmaceutical compositions of the invention include 

nucleic acid sequences encoding the preprohormone inserted 
into a suitable gene therapy vector. A variety of different 
gene therapy vectors may be used. Current strategies and 
vectors for gene therapy are reviewed in Miller A.D, (1992) 

20 JJature, 357:455-460 and Mulligan, R.C. (1993) Science 260:926- 
932. The gene therapy vector may be delivered to an 
individual patient, typically by systemic administration 
ie.g., intravenous , intraperitoneal , intramuscular , subdermal , 
or intracranial infusion) . For systemic administration, 

25 injection is preferred, including intramuscular, intravenous, 
intraperitoneal, and subcutaneous injection. Suitable 
formulations for injection are found. in Remington 
Pharmaceutical Sciences, supra. 'The pharmaceutical 
compositions are suitable in a variety of drug delivery 

30 systems. For a brief review of present methods of drug 

delivery. See, Langer, Science 249:1527-1533 (1990) which is 
incorporated herein by reference. 



wo 95/12309 



34 



PCT/US94/12763 



i= DetectXPn Of Nuclfiic acids encoding fHigS .Trp^^.Tyr^l -GnRH 

preprohormone or fSer^l -GnRH preprohormone p o lvpepi^idpg and 
detection of FHi s^.Trp'^.Tvr^l -GnRH preprohormone or fSer^l - 
GnRH preproho rmone polypeptides by immunoassays 

The present invention provides methods for detecting 
DNA or RNA encoding [His^, Trp'^.Tyr®] -GnRH preprohormone or 
[Ser®] -GnRH preprohormone and for measuring these 
preprohormones by immunoassay techniques. These methods are 
useful for two general purposes. First, assays for detection 
Of nucleic acids encoding these preprohormones are important 
for the isolation these nucleic acids from a variety of 
species. As described above, it is known that the mature 
lHis^,Trp'^,Tyr®] -GnRH hormone is present in a wide range of 
species (See Sherwood, N.M. et al., supra). Now that the 
[His^,Trp'',Tyr®] -GnRH preprohormone and tSer®] -GnRH cDNA's 
have been cloned and isolated, nucleic acids encoding these 
preprohormones may be isolated from a variety of species 
according to the methods described in section (B) above and by 
use of the nucleic acid hybridization assays described below. 
The immunoassays described below may be useful for isolation 
of nucleic acids encoding the [His^ , Trp'^ , iyr®] -GnRH or [Ser®] - 
GnRH preprohormones by expression cloning methods. See 
section (B) above and Sambrook, et al. 

The assays described below are also useful as in 
vitro diagnostic assays to determine the reproductive status 
of farm animals or of fish in aquaculture. For example, 
testing of fish for tHis5,Trp*',Tyr®] -GnRH or tSer®] -GnRH 
preprohormone production at various times may play a role in 
the regulation of fish reproduction in aquaculture. In 
addition, the discovery of the lHis5,Trp'',Tyr®] -GnRH mature 
hormone in placental mammals (See Sherwood, N.M. , supra) and 
the demonstration, herein, of the [His5,Trp'^,Tyr®] -GnRH 
preprohormone in the treeshrew suggests that the 
[His^,Trp*',Tyr^] -GnRH preprohormone may also be produced in 
hximans. Therefore, the assays described below may also be 
useful in tests for reproductive function in humans. 
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1. Nucleic Acid Hybridization Assays 

A variety of methods for specific DNA and RNA 
measurement using nucleic acid hybridization techniques are 
known to those of skill in the art. See Sambrook, et al. For 
5 example, one method for evaluating the presence or absence of 
[His^,Trp*^,Tyr®] -GnRH preprohormone DNA or [Ser®] -GnRH 
preprohormone DNA in a sample involves a Southern transfer. 
Briefly, the digested genomic DNA is run on agarose slab gels 
in buffer and transferred to membranes. Hybridization is 

10 carried out using the nucleic acid probes discussed above. As 
described above, nucleic acid probes are designed based on the 
known nucleic acid sequences encoding the [His^,Trp'',Tyr®] - 
GnRH preprohormone or the [Ser®] -GnRH preprohormone. 
Visualization of the hybridized portions allows the 

15 qualitative determination of the presence or absence of DNA 
encoding [His^, /rp'^^Tyr®] -GnRH preprohormone or the [Ser®] - 
GnRH preprohormone. 

Similarly, a Northern transfer may be used for the 
detection of mRNA encoding [His^,Trp'^,Tyr®] -GnRH preprohormone 

20 or the [Ser®] -GnRH preprohormone. In brief, the mRNA is 

isolated from a given cell sample using an acid guanidinitam- 
phenol -chloroform extraction method. The mRNA is then 
electrophoresed to separate the mRNA species and the mRNA is 
transferred from the gel to a nitrocellulose membrane. As 

25 with the Southern blots, labeled probes are used to identify 
the presence or absence of the [His^,Trp'',Tyr®] -GnRH or the 
[Ser®] -GnRH preprohormone trauiscript. 

A variety of nucleic acid hybridization formats are 
known to those skilled in the art. For example, common 

30 formats include sandwich assays and competition or 

displacement assays. Hybridization techniques are generally 
described in "Nucleic Acid Hybridization, A Prstctical 
Approach, ^ Ed. Hames, B.D. and Higgins, S.J., IRL Press, 1985; 
Gall and Pardue (1969), Proc. Natl. Acad. Sci., U.S.A., 

35 63:378-383; eind John, Bumsteil and Jones (1969) Nature, 
223:582-587. 

For example, sandwich assays are commercially useful 
hybridization assays for detecting or isolating nucleic acid 
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sequences. Such assays utilize a "capture" nucleic acid 
covalently immobilized to a solid support and labelled 
"signal" nucleic acid in solution. The clinical sample will 
provide the target nucleic acid. The "capture" nucleic acid 
5 and "signal" nucleic acid probe hybridize with the target 

nucleic acid to form a "sandwich" hybridization complex. To 
be effective, the signal nucleic acid cannot hybridize with 
the capture nucleic acid. 

Typically labelled signal nucleic acids are used to 

10 detect hybridization. Complementary nucleic acids or signal 
nucleic acids may be labelled by any one of several methods 
typically used to detect the presence of hybridized 
polynucleotides- The most common method of detection is the 
use of autoradiography with ^H, ^^^l, ^^s, "c, or ^^P-labelled 

15 probes or the like. Other labels include ligands which bind 
to labelled antibodies, f luorophores , chemiluminescent agents, 
enzymes, and antibodies which can serve as specific binding 
pair members for a labelled ligand. 

Detection of a hybridization conplex may require the 

20 binding of a signal generating complex to a duplex of target 
and prbbe polynucleotides or nucleic acids . Typically, such 
binding occurs through ligand and anti- ligand interactions as 
between a ligemd- conjugated probe and an anti-ligcmd 
conjugated with a signal. 

25 The label may also allow indirect detection of the 

hybridization complex. For example, where the laJDel is a 
hapten or antigen, the sanple can be detected by using 
antibodies. In these systems, a signal is generated by 
attaching fluorescent or enzyme molecules to the antibodies or 

30 in some cases, by attachment to a radioactive label. 

(Tijssen, P., "Practice and Theory of Enzyme Immunoassays," 
Laboratory Teclinigues in Biochemistry and Molecular Biology, 
Burdon, R.H., van Knippenberg, P.H., Eds., Elsevier (1985), 
pp. 9-20.) 

35 The sensitivity of the hybridization assays may be 

enhamced through, use of a nucleic acid antplif ication system 
which multiplies the target nucleic acid being detected. 
Exait?)les of such systems include the polymerase chain reaction 
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(PGR) system and the ligase chain reaction (LCR) system. 
Other methods recently described in the art are the nucleic 
acid sequence based anrplif ication (NASBA™, Cangene, 
Mississauga, Ontario) and Q Beta Replicase systems. 

An alternative means for determining the level of 
expression of the [His^,Trp'^,Tyr®l -GnRH preprohormone gene or 
the [Ser®] -GnRH preprohormone is in situ hybridization. In 
situ hybridization assays are well known and are generally 
described in Angerer, et al.. Methods Enzymol., 152:649-660 

(1987) . In an in situ hybridization assay, cells, 
preferentially bovine lymphocytes are fixed to a solid 
support, typically a glass slide. If DNA is to be probed, the 
cells are denatured with heat or alkali. The cells are then 
c: atacted with a hybridization solution at a moderate 
temperature to permit annealing of labeled probes pecific to 

tHis^,Trp'^,Tyr®3 -GnRH preprohormone or the [Ser®] -GnRH 
preprohormone. The probes are preferably labelled with 
radioisotopes or fluorescent reporters . 

2^ Production of Antibodies and Development of 
Immunoassays 

In addition to detecting e3qpression of the 
[His^^Trp'',Tyr®] -GnRH preprohormone or the [Ser®] -GnRH 
preprohormone by nucleic acid hybridization, one can also use 
immunoassays to detect the preprohormones . Immunoassays can 
be used to qualitatively or quantitatively analyze for the 
preprohormone or to specifically detect the GAP peptide. A 
general overview of the applicable technology can be f oxind in 
Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring 
Harbor Pubs., N.Y. (1988), incorporated herein by reference. 

a. Antibody Production 

A number of immunogens may be used to produce 
antibodies specifically reactive with the lHis^,Trp'^,Tyr®] • 
GnRH preprohormone or the [Ser®] -GnRH preprohormone. 
Recombinajit preprohormone is the preferred immunpgen for the 
production of monoclonal or polyclonal antibodies. Naturally 
occurring preprohormone may also be used either in pure or 
impure form. Synthetic peptides made using the 
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[His5,Trp"^,Tyr®] -GnRH preprohorxnone or the [Ser®] -GnRH 
preprohonnone sequences described herein may also used as an 

immunogen for the production of antibodies to the 
preprohonnone. Preferentially, recombinant [His^,Trp'^,Tyr®] - 
GnRH preprohormone or [Ser®] -GnRH preprohonnone or fragments 
thereof are expressed in bacterial cells as described above, 
and purified as generally described above. The product is 
then injected into an animal capable of producing antibodies. 
Either monoclonal or polyclonal antibodies may be generated, 
for subsequent use in immunoassays to measure the 
preprohormone . 

Methods of production of polyclonal antibodies are 
known to those of skill in the art. In brief, an immunogen, 
preferably a purified protein, is mixed with an adjuvant and 
animals are immunized. The animal's immune response to the 
immunogen preparation is monitored by taking test bleeds and 
determining the titer of reactivity to the [His^,Trp'^,Tyr®] - 
GnRH preprohonnone or the tSer®] -GnRH preprohormone. When 
appropriately high titers of antibody to the immunogen are 
obtained, blood is collected from the animal and antisera is 
prepared. Further fractionation of the antisera to enrich for 
antibodies reactive to the preprohormone can be done if 
desired. (See Harlow and Lane, supra) . 

Monoclonal antibodies may be obtained by various 
techniques familiar to those skilled in the art. Briefly, 
spleen cells from an animal immunized with a desired antigen 
are immortalized, commonly by fusion with a myeloma cell (See, 
Kohl er and Mil stein, Eur. J. Immunol. 6:511-519 (1976), 
incorporated herein by reference) . Alternative methods of 
immortalization include transformation with Epstein Barr 
Virus, oncogenes, or retroviruses, or other methods well known 
in the art. Colonies arising from single immortalized cells 
are screened for production of antibodies of the desired 
specificity and affinity for the antigen, and yield of the 
monoclonal antibodies produced by such cells may be enhanced 
by various techniques, including injection into the peritoneal 
cavity of a vertebrate host. 
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Methods of production of synthetic peptides are 
known to those of skill .\ the art. Briefly, the predicted 
immunogenic regions of 3^,Trp'^,Tyr®] -GnRH preprohormone or 
[Ser®] -GnRH preprohormozia sequences described herein are 
5 identified. For production of antibodies specific to the 
[His^^Trp'^^Tyr®] -GnRH or [Ser®] -GnRH preprohormone GAP 
peptides, predicted iiran; logenic regions of the preprohormone 
sequence in the GAP region of the molecule are used (see Seq, 
ID No. 2 , Seq ID No. 18 and Seq. ID No 20. Peptides 

10 preferably at least 10 amino acids in length are synthesized 

corresponding to these regions and the peptides are conjugated 
to larger protein molecules for svibsequent immunization. 
Production of monoclonal or polyclonal antibodies is then 
carried out as described above. 

15 b. Immunoassays 

Antibodies reactive with a particular protein can be 
measured by a variety of immunoassay methods . For a review of 
immunological and immunoassay procedures in general. See Basic 
and Clinical Immnology 7th Edition (D. Stites and A. Terr 

20 ed.) 1991. Moreover, the immunoassays of the present 

invention can be performed in any of several configurations, 
which are reviewed extensively in Enzyme Immunoassay, E.T. 
M^ggio, ed., CRC Press, Boca Raton, Florida (1980); "Practice 
and Theory of Enzyme Immunoassays," P. Tijssen, Laboratoxy 

25 redmlgues in Biochemistry and Molecular Biology, Elsevier 
Science Publishers B.V. Amsterdam (1985) ; and, Harlow and 
Lane, AntiJbociies, A Laboratory Manual, supra, each of which is 
incorporated herein by reference. 

Immunoassays for measurement of [His^,Trp'',Tyr®] - 

30 GnRH preprohormone or [Ser®] -GnRH preprohormone polypeptides 
Ccm be performed by a variety to methods known to those 
skilled in the art. In brief, immunoassays to measure the 
preprohormones or t> GAP peptides can be either conqpet itive 
or noncompetitive binding assays. In competitive binding 

35 assays, the Scunple analyte competes with a laibeled analyte for 
specific binding sites on a capture agent bound to a solid 
surface. Preferably the capture agent is an antibody 
specifically reactive with the tHis^,Trp'^,Tyr®] -GnRH 
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preprohormone or [Ser®] -GnRH preprohormone produced as 
described ahove. The concentration of labeled analyte bound 
to the capture agent is inversely proportional to the amount 
of free analyte present in the sample • 
5 In a competitive binding immunoassay, 

[His^^Trp'^^Tyr®] -GnRH preprohormone or [Ser®] -GnRH 
preprohormone present in the sample competes with labelled 
preprohormone for binding to a specific binding agent, for 
example, an antibody specifically reactive with the 

10 preprohormone. The binding agent may be bound to a solid 

surface to effect separation of bound labelled preprohormone 
from the unbound labelled preprohormone. Alternately the 
coitpetitive binding assay may be conducted in liquid phase and 
any of a variety of techniques known in the art may be used to 

15 separate the bound labelled preprohormone from the unbound 

labelled preprohormone. Following separation, the amoxxnt of 
bound labeled preprohormone is determined. The amount of 
preprohormone present in the sample is inversely proportional 
to the amount of labelled preprohormone binding. 

20 Alternatively, a homogenous immunoassay may be 

performed in which a separation step is not needed. In these 
immunoassays, the label on the preprohormone is altered by the 
binding of the preprohormone to its specific binding agent. 
This alteration in the labelled protein results in a decrease 

25 or increase in the signal emitted by label, so that 

measurement of the label at the end of the immunoassay allows 
for detection or quantitation of the preprohormone. 

[His^,Trp'',Tyr®] -GnRH preprohormone or [Ser®] -GnRH 
preprohormone may also be determined by a variety of 

30 noncompetitive immunoassay methods. For exairple, a two-site, 
solid phase sandwich immunoassay is used. In this type of 
assay, a binding agent for the preprohormone, for example an 
amtibody, is attached to a solid phase. A second 
preprohormone binding agent, which may also be an antibody, 

35 and which binds the preprohormone at a different site, is 

labelled. After binding at both sites on the preprohormone 
has occurred, the unbound labelled binding agent is removed 
and the amount of labelled binding agent bound to the solid 
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phase is measured. The amount of labelled binding agent bound 
is directly proportional to the amount of preprohormone in the 
sample . 

Western blot analysis can also be done to determine 
the presence of [His^,Trp*^,Tyr®] -GnRH or [Ser®] -GnRH 
preprohormone in a sample. Electrophoresis is carried out, 
for example, on a tissue sample suspected of containing the 
preprohormone. Following electrophoresis to separate the 
proteins, and transfer of the proteins to a suitable solid 
support such as a nitrocellulose filter, the solid support is 
then incubated with an antibody reactive with the 
preprohormone. This antibody may be labeled, or alternatively 
may be it may be detected by sxibsequent incubation with a 
second labelled antibody that binds the anti- preprohormone 
antibody. 

The immunoassay formats described above employ 
lcd:>elled assay con^^onents. The label can be in a variety of 
forms. The label may be coupled directly or indirectly to the 
desired component of the assay according to methods well known 
in the art. A wide variety of labels may be used. The 
component may be labeled by any one of several methods. 
Traditionally a radioactive label incorporating ^h, 125 35g^ 
"c, or used. Non- radioactive labels include ligands 

which bind to labelled antibodies, f luorophores , 
chemiluminescent agents, enzymes, and antibodies which can 
serve as specific binding pair members for a labelled ligand. 
The choice of label depends on sensitivity required, ease of 
conjugation with the compound, stability requirements, and 
available instrumentation. For a review of various labelling 
or signal producing systems which may be used, see U.S. Patent 
No. 4,391,904, which is incorporated herein by reference. 

This invention also embraces diagnostic kits for 
detecting the presence of [His^, Trp'',Tyr®] -GnRH preprohormone 
or [Ser®3 -GnRH preprohormone in tissue or blood samples which 
comprise a container containing antibodies selectively 
immunoreactive to the preprohoimone and instructional material 
for performing the test. This invention further embraces 
diagnostic kits for detecting IHis5,Trp'',Tyr®] -GnRH or [Ser®] • 
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GnRH preprohormone DNA or RNA in tissue or blood samples which 
comprise nucleic probes as described herein and instructional 
material . 
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EXAMPLES 

Example l; Isolation of a cDNA encoding H. burton! 
[pis5 , Trp"^ , Tyr^ 1 ' QnRH prepronormone 



5 hi Development of an ol.itTonucleotide probe specific for 

H. burt oni THis^ , Trp"^ .Tyr^l -GnRH preprohormone cDNA 
To Obtain nucleotide sequences encoding a 
[His5,Trp'^,Tyr®] -GnRH-like peptide from H. Jburtoni, we 
employed a novel polymerase chain reaction (PCR) strategy 

10 using brain regions known to contain GnRH neuronal populations 
(see Pig, 4 for a schematic view of H. burtonl brain) • 
Animals were sacrificed and whole brains removed as described 
in White, S.A. & Femald, R.D. J. Neurosci. (1993) 13:434-441. 
The dorsal portion of the telencephalon, the optic tectum and 

15 cerebellum from each brain was removed. Total RNA wais 

isolated from the remaining ventral portions by guanidine 
thiocyanate-acid phenol extraction and converted into cDNA 
using reverse transcriptase (Superscript, BRL; Gaithesburg, 
MD) . cDNA synthesis was primed with 0.5 /ig of a bipartite 

20 oligonucleotide consisting of a homopolymer of 9 d(T) residues 
at the 3' end. The 5' domain comprised a sequence of 14 
nucleotides which included a restriction endonuclease 
recognition site. The sequence of the oligonucleotide is: 5' 
GCAGAAGCTTCAGCT(9) 3' (Seq. ID No. 15). 

25 The resulting cDNA was used as substrate for nested 

PCR. In both rounds of amplification, the downstream primer 
was equivalent to the 5* domain of the bipartite 
oligonucleotide, described above. In the first PCR reaction, 
the upstream primer was a pool of 24 14-mers representing all 

30 possible coding sequences for the first five amino acids of 
[His^^Trp'^^Tyr^] -GnRH (see figure Kb)). 12.5 janol of the 
upstream primer was used. After 40 cycles of 94*'C/30 s; 
55^C/30s; 72«^C/15s, 0.05 fil of the product of this reaction 
was aliquoted to a second reaction using 12.5 pmol of a new 

35 upstream primer together with the same downstream primer under 
the same cycling conditions. The new primer pool of 24 16- 
mers consisted of all possible coding combinations for 
residues 5-9 of the decapeptide (see figure 1(b)). Reactions 
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were performed using Taq polymerase (synthesized courtesy of 
Dr. R. Moses) on a Perkin-Elmer 9600 thennocycler . 
Oligonucleotides were synthesized as needed using an implied 
Biosystems 391 instrument. 

PGR products were separated using a 1.5% GTG agarose 
gel (EMC, Rockville, ME, USA) and products of greater than 300 
bases were electrocuted and subcloned into M13 for sequencing 
as described in Bond, C.T., Francis, R.C., Pemald, R.D. & 
Adelman, J. p. (1991) J. Mol. Endocrinol. 5:931-937. 
Nucleotide sequences were analyzed for the presence of the 
second upstream primer (ending in the codon for amino acid 9; 
see Fig. 2), followed by a codon for the final conserved amino 
acid of the decapeptide, ^°Gly, and sequences for the 
canonical GnRH amidation and peptide processing site, 
iiQlyi2Lysi3;^g^ (See Douglass, J., Civelli, 0. & Herbert, E. 
(1984) Ann. Rev. Biochem. 53:665-715.) 

A radioledaeled nucleotide probe 29 bases in length 
was derived from a putative [His5,Trp'',Tyr8] -GnRH PGR product. 
The sequence of this oligonucleotide is 
GGGAATGCAGGTACGTGAGAGCGCAGAGG (Seq. ID No. 16) . 



fil Isolation of a full -length cDNA enrndino ^hg W. 
bWpni rHisS.Trp^Tvrgl-gnRH n^^ ^^T,..^^^^ 
To obtain full-length coding sequences, poly-adenylated 
RNA was isolated from H. burtoni ventral brain regions (see 
above) and used to construct a cDNA library in XgtlO. 250,000 
primary recombinant were screened (Genescreen; NEN-Dupont, 
Boston, MA, USA) using the radiolabeled oligonucleotide of Seq 
ID No. 16, described above. Positively hybridizing phage were 
purified by successive rescreens at reduced density. The cDNA 
inserts were subcloned into M13 phage and the nucleotide 
sequences determined. A cDNA containing the full-length 
coding sequence was subcloned into a transcription vector 
(Pselect, USB) for generation of sense and antisense 
riboprobes . 
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Pxamp3.g 2; Sequence Analysis of the Jf. hurtoni cDNA encoding 

the rHis^,Trp'^,Ty r^1 -GnRH preprohormone 

Six positively hybridizing clones were identified 
from the H. Jburtoni ventral brain cDNA library as described 
5 above and the nucleotide sequences of the inserts were 

determined. The full-length cDNA sequence is shown _n Seq. ID 
No. 1, and the predicted amino acid sequence for the H. 
burtoni chicken II GnRH preprohormone is shown in figure 2 
(Seq. ID No. 2) . The longest open reading frame containing 

10 the [His^,Trp'',Tyr®] -GnRH coding sequence begins with a 
methionine initiator codon and predicts a 90 amino acid 
protein (Fig. 2) . The first 23 residues are largely 
hydrophobic and are likely a signal peptide as is found in 
many polyprotein neuroendocrine precursors (see Douglass, J., 

15 Civelli, 0. & Herbert, E. (1984) Ann. Rev. Blochem. 

53:665-715.) The signal sequence is in direct li kage with 
[His^,Trp'',Tyr®] -GnRH, which is followed by Gly^^Lys^^Arg" . 
These residues follow the decapeptide sequence in all of the 
other cloned GnRH preprohormones and serve as substrates for 

20 post-translational processing (See Douglass, J., et al., 

supra). The amino acid sequence of the [His^,Trp'^,Tyr®] -GnRH 
preprohormone (Seq. ID No. 3) is shown for comparison in 
Figure 2. The remainder of the precursor may code for two 
additional peptides also generated by proteolytic processing, 

25 neither of which show any homology to sequences in protein 

databases. One peptide, 28 residues in length, is followed by 
dibasic residues (arg-arg, see Fig. 2 legend), and the 
remaining 19 amino acids could comprise a second processing 
product. There is only about 10% identity between the two H. 

30 Jburtoni GnRH preprohormones when the GnRH motif is excluded 
(see Figure 2) . 

Example 3; Northern blot analysis of K, burtoni brain tissue 
for the F HisS.Trp'^.Tvr^l -GnRH Precursor 
35 Northern blots were prepared as previously described 

(see Bond, C.T. , Francis, R.C, Fernald, R.D. & Adelman, J. P. 
(1991) «7. tfol. Endocrinol. 5:931-937) and probed with 
antisense riboprobes synthesized with SP6 RNA polymerase, 
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incorporating ^^P- labelled UTP to a specific activity of 3 x 
10® dpia//ig* Hybridizations contained approximately 10*^ dpm 
per ml^ High stringency conditions were obtained using 50% 
formamide at 65«*C with washes in 0.1 x SSC at 70«*C. 
5 Northern blot analysis of total RNA extracted from 

dorsal, ventral and whole H. burton! brain revealed a single 
transcript of ca. 530 bases expressed in ventral and whole 
brain but absent from dorsal structures (Pig. 3) 
Radiolabelled antisense riboprobes for both the 

10 [His^,Trp'^,Tyr®] -GnRH and [Trp'', Leu®] -GnRH preprohormones were 
prepared and used to probe Northern blots of polyA-i- RNA 
isolated from total brain at both high and low stringencies. 
The results (not shown) indicate that the probes for the two 
preprohormones do not cross -hybridize at either stringency 

15 with the alternative transcript. 

Example 4: In situ hybridization 

The site of [His^,Trp"',Tyr®l -GnRH mRNA production in 
H. burtoni brain was assessed using in situ hybridization. 

20 Animals were sacrificed, brains removed and tissue fixed as 
previously described (See White , et al. / t7. Neurosci. (1993) 
13:434-44). 40 fm cryostat sections were mounted on poly 
L- lysine subbed slides for hybridization to the digoxygenin 
UTP (DIG, Boehringer Mannheim) -labeled [His^,Trp'',Tyr®] -GnRH 

25 riboprobe. Hybridizations were carried out at high stringency 
(55*^0 in 50% formamide) according to the manufacturer's 
instructions with the following modifications: 150 fil of 
hybridization solution was applied to slides containing 4 to 5 
brain sections. Following hybridization, 3 washes were done 

30 in 2 X SSC at 55*^0 for one hour each, followed by one wash in 
1 X SSC for 30 min at room temperature. To prepare the 
riboprobe , subcloned prepro (His^ , Trp"^ , Tyr® ] - GnRH was 
linearized with Eco RI (GIBCO-BRL, Grand Island, New York, 
USA) , followed by filling in of recessed 3 • termini with the 

35 Klenow fragment of DNA polymerase I and Proteinase K 

treatment. The template DNA was purified using Magic DNA 
Clean- Up System (Promega Corporation, Madison, Wisconsin, USA) 
cuad 300 ng was used for transcription of a riboprobe with SP6 
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RNA polymerase (GIBCO-BRL) in the presence of DIG -conjugated 
UTP, Final concentrations in a lOO^l reaction volume were: l 
X SP6 reaction buffer, 1 X NTP labeling mixture 
(Boehringer-Mannheim, Indianapolis, Indiana ; USA), 4.0 mM DTT, 
80 units RNAsin (Promega) , 13-2 tig BSA, and 120 units of SP6 
polymerase. Solutions for both Northern analysis and in situ 
hybridizations were prepared using water treated with 0.075% 
diethylpyrocarbonate . 

[His^,Trp'^,Tyr^] -GnRH mRNA expression, localized 
using in situ hybridization^ was found only in a cluster on 
neurons foxind in the mesencephalon (Fig. 4 (TOP)) previously 
shown to contain GnRH using immunocytochemistry. Davis, M.R. 
& Femald, R.D., 47. Neurobiol. (1990) 21:1180-1188. 
Interestingly, [His^,Trp'^,Tyr®] -GnRH is not expressed in the 
hypothalamic nucleus (Fig. 4 (LEFT COLUMN)) which projects to 
the pituitary and hence probably does not directly influence 
gonadotropin activation. The probes for the two species of 
GnRH found in H. Jburtoni do not cross -react either when 
applied in situ (see Fig. 4) or in Northern blots (see above) 
Thus distinct forms of GnRH encoding genes are expressed in 
distinct brain regions. 

In burtoni, as in other vertebrates, three 
neuronal populations have been shown to contain GnRH using 
immunocytochemistry: the terminal nerve, the 

hypothalamic/preoptic area and the mesencephalon. Davis, M.R. 
& Femald, R.D., J. Neurobiol. (1990) 21:1180-1188; White, 
S.A. & Femald, R.D. , J. Neurosci. (1993) 13:434-441; see Fig. 
4, TOP). Here we have shown that lHis^,Trp'^,Tyr®] -GnRH mRNA 
is esqpressed only in the mesencephalic population (Fig. 4, 
(LEFT COLUMN)) . In contrast, [Trp*^, Leu®] -GnRH mRNA is 
localized only in the terminal nerve nucleus of the 
telencephalon (Fig. 4 (RIGHT COLUMN)). Furthermore, the lack 
of in situ hybridization within the GnRH-immunoreactive cells 
of the hypothalamic/preoptic area (Fig. 4) reveals that the 
GnRH-encoding gene expressed there is sufficiently different 
to elude detection by either probe. Thus, despite its potent 
releasing factor activity (cf. Ngamvongchon, S. Rivier, J.E. & 
Sherwood, N.M. (1992), Reflrul. Pept. 42:63-73)), 
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[His5,Trp'^,Tyr®] -GnRH is unlikely to influence pituitary 
function directly in H. burton! . Moreover, this suggests that 
yet a third gene encoding for GnRH may be responsible for 
regulating reproduction directly. 
5 The structure of the [His^, Trp'^^Tyr®] -GnRH predicts 

two novel peptides in addition to it, in contrast to the 
single associated peptide predicted by [Trp"^, Leu®] -GnRH inRNA 
(see Figure 2) , The lack of homology between the two known 
GnRH preprohormones in burtoni suggests that these genes 

10 have either evolved independently or diverged from a common 
ancestor well before the appearance of teleost fish. 

The novel preprohormone structure and 
extrahypothalamic location of the H. burtoni [His^,Trp'^,Tyr®] - 
GnRH precursor may indicate a unique function for this GnRH 

15 form and/or its associated peptides. While hypothalamic GnRH 
clearly functions as a releasing hormone, neuromodulatory and 
neurotransmitter roles have been postulated for GnRHs in other 
brain regions (reviewed in Louroaye, E. , Thomer, J. & Catt, 
K.J. (1982) Science 218:1323-1325). The localization of the 

20 [His^,Trp'^,Tyr®l -GnRH mRNA to the mesencephalon supports 

previous imuhologlcal studies in which (His^,Trp*^,Tyr®] -GnRH- 
like peptides were found concentrated in caudal brain areas. 
Such mesencephalic GnRH- containing neurons may play a role in 
the central organization of reproductive behavior as suggested 

25 for poecilid fish where a similar neuronal population has been 
described. See Miller, K.E. & Kriebel, R.M. (1986) Gen. Comp. 
Endocrinol. 64:396-400. These cells contain GnRH and project 
to a spinal neurosecretory group which, in turn, has been 
postulated to play a role in gonadal duct contractility. 

30 As noted, only one GnRH peptide (previously called 

mammalian GnRH) , that which regulates the pituitary 
gonadotropes, has been demonstrated in more recently evolved 
mammals. This form can be detected in some bony fish, but is 
absent in H. burtoni as well as in reptiles and birds 

35 (reviewed in Loumaye, E., Thomer, J. & Catt, K.J. (1982) 

Science 218:1323-1325, supra). In contrast, [His^^Trp'^^Tyr®] - 
GnRH has widespread expression among vertebrates with the 
notable exception of more derived mammals. Since non- 
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mammalian animals express more than one GnRH peptide, it is 
possible that many mammals may express more than one form of 
GnRH, particularly the [Ser®] -GnRH form. 

5 Example 5: Isolation of a cDNA encoding a treeshrew 
[His^-Trp'^.Tyr^l -GnRH preprohormone 

To isolate the sequence encoding the 
[His^,Trp'^,Tyr®3 -GnRH preprohormone isoform in the treeshrew 
Tupaia bulangeri, we used a polymerase chain reaction (PGR) 

10 strategy similar to that used in the isolation of this isoform 
in a teleost fish, Haplochromis burton! , as described Example 
1, herein. The amino acid sequence of the [^His,'^Trp, 
®Try]GnRH decapeptide guided selection of primers for PGR 
amplification. Treeshrew brain mRNA was primed with a 

15 bipartite oligonucleotide for cDNA synthesis. The 3' domain 
of this oligonucleotide consisted of a homopolymer of d{T)24 
residue, while the 5' domain comprised a known sequence of 10 
nucleotides which included a restriction endonuclease 
recognition site. The resulting cDNA was used as substrate 

20 for nested PC31. The downs treaun primer in both rounds of 

amplification was bipartite, A randomly generated hexamer 
including all possible combinations of nucleotides at seven 
positions composed the 3' end and ten known nucleotides, 
including an Hind III restriction endonuclease recognition 

25 site, made up the 5* end. In the first PGR reaction, the 

upstream primer was a pool of 24 tetradecamers representing 
all possible coding sequences for the first five amino acids 
of [His5,TiTp*',Tyr®] -GnRH. The sequence of these primers is 
represented in the following two sequences : GCACXSAATTCCA A/G 

30 CA T/C TGGTCNCA (Seq. ID No. 21) and GCACGAATTCCA A/6 CA T/C 
TGGAG T/C CA (Seq. ID No. 22) . The product of the first PGR 
reaction served as substrate for nested: reactions where the 
new upstream primer was bipartite. The 5' end of this primer 
contained a known restriction enzyme recognition sequence 

35 while the 3' end con5)rised a pool of 24 tetradecamers 

representing all possible coding combinations for residues 5-8 
of the decapeptide. These sequence this pool of primers is: 
GCACGAATTCCA T/C GGNTGGTA T/C CC (Seq. ID No. 23) . 
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PGR products were separated by electrophoresis in a 
2.5% GTG agarose gel (FMC) and products > 250 bases 
electrocuted and siibcloned into M13 phage vector for sequence 
analysis (Bond et aJ,, 1991). Nucleotide sequences were 
analyzed for the presence of the second upstream primer 
followed by the codona for the final amino acid of the 
decapeptide, Gly, and the next three amino acids Gly, Lys, and 
Arg. These residues follow the decapeptide sequence in the 
other cloned GnRH preprohormones and serve as substrates for 
posttranslational processing. One clone contained these 
landinarks followed by an open reading frame of 165 (or 219) 
nucleotides 55 (or 73) amino acids. The numbers in 
parentheses refer to the nucleotide and amino acid length if 
the alternate codon using methionine is used as the start 
codon. To confirm the identity of this clone as the 
tHis^,Trp'',Tyr®] -GnRH preprohormone , a downstream region of 
the clone was chosen from which to generate new primers for 5 ' 
rapid amplification of cDNA ends (RACE). See Prohman et al. 
in (1993) Methods in Enzymology Volvme 218, Wu, R. , editor. 
Academic Press, Inc., San Diego, California, USA for a more 
detailed des cript ion of RACE methodology . 

Substrate cDNA for the 5' RACE was synthesized from 
mRNA primed with randomly generated hexamers including all 
possible combinations of nucleotides at 7 positions. These 
cDNA products were tailed at their 5' ends with dATP in a 
terminal deoxytransf erase reaction (GIBCO BRL, Grand Island, 
New York, USA) . Two rounds of PGR amplification were 
performed on the tailed cDNA in which the 3' antisense primer 
consisted of a bipartite oligonucleotide (poly d(T)l7 plus 
Hind III recognition site) designed to sumeal to the 5' 
terminal poly d(A)s. The oiiter 5' antisense primer for the 
first round of PCR consisted of a septadecamer corresponding 
to a region 111-127 nucleotides downstream of the decapeptide 
region. The oligonucleotide sequence for the outer primer was 
CAGGGCACaiCTGTCCTC (Seq. ID No. 24) . For the second round of 
anplification, the inner 5' antisense primer was bipartite, 
composed of a known sequence of ten nucleotides including a 
restriction endonuclease cleavage site plus another 17 
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nucleotides 72-88 nucleotides downstream of the decapeptide 
region of the preprohormone candidate. The oligonucleotide 
sequence for the inner primer was GCACGAATTCCGAAGCTATGAGCAGTC 
(Seq. ID No. 25) • PGR products were prepared for sequencing 
5 (as above) and screened for the presence of the innermost 
primer followed by antisense sequences from the candidate 
clone and by antisense codons for the GnRH decapeptide. A 
majority of the generated clones were positive for the 
[His^,Trp'',Tyr®] -GnRH encoding region and extended 
10 approximately 135 more nucleotides before ending in the poly 
d(T)/Hind III primer- sequence. Only one clone was isolated 
containing preprohormone sequence 3' to that region amplified 
by 5' RACE. 

3-5 Example 6: Sequence Analvai s of the treeshrew cDNA encoding 

the THis^ , Trp*^ . Tyr^ 1 -GnRH preprohormone 

Only one clone was isolated in Example 5 which 
contained preprohormone sec[uence 3' to that region amplified 
by 5' RACE. To confirm the sequence, we designed a 

20 nondegenerate bipartite oligonucleotide whose 5' end consisted 
of ten nucleotides including a restriction endonuclease 
cleavage site and whose 3' end encoded residues 3-8 of the 
decapeptide region. The sequence of the oligonucleotide was 
GCACGAATTC6GTCCCACGGCTGGTAC (Seq. ID No. 26). Poly d(T) 

25 primed cDNA was used as substrate in PGR aitqplif ication between 
the above oligonucleotide and a bipartite poly d(T)/Hind III 
primer downstream. Products were prepared, sec[uenced and 
screened as before resulting in many positive clones 
containing the same sequence as that found in the primary 

30 downstream clone and ending in polyadenylated tails. 

The fullest secpience of this preprohormone was 
obtained by aligning overlapping regions from at least ten 
consensus 5' RACE templates with at least ten consensus . 
sequences generated in the first and second 3 ' PGR 

35 aitplifications. Sequences were aligned (Color Alignment 

Macros, M. Haygood) and differences attributed to PCR-error 
were assessed. The deduced amino acid sequence is shown in 
Seq. ID No. 18. As with GnRH preprohormones previously 
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Isolated, a signal sequence precedes the decapeptide, followed 
by a conserved proteolytic processing site and an associated 
peptide. (See Cowparative Biology and Evolutionaxy 
Relationships of Treeshrews, W.P. Luckett, ed. , Plenum Press, 
5 New York (19 80).) Treeshrews are part of the super order 
Archonta, which includes primates, flying lemurs, bats and 
elephant shrews. Treeshrews are thus very closely related to 
primates, yet primitive. As a consequence, they are very 
frequently used to discover information about molecules, 
10 processes, etc., which may link much more primitive rodents 

with primates. Thus, [Ser^] -GnRH preprohormone may be present 
in primates including humans . 

Example 7: Isolation of a cDNA encoding H. burtoni TSer^l -GnRH 

15 preprohormone 

To obtain nucleotide sequences encoding a [Ser^] - 
GnRH*like peptide from H. burtoni, a novel polymerase chain 
reaction (PGR) strategy using brain regions known to contain 
GnRH neuronal populations was exnployed (see Fig. 4 for a 

20 schematic view of H. burtoni brain) . Animals were sacrificed 
cuid wliole brains removed as described in White/ S. A. & 
Femald, R.D. J. Neurosci. (1993) 13:434-441. H. burtoni mRMA 
was isolated from the preoptic area, brain tissue known to 
contain the GnRH neuronal population which projects to the 

25 pituitary. cDNA synthesis was primed with a bipartite 

oligonucleotide whose 3' domain consisted of a homopolymer of 
d(T)9 residues, while the 5' domain comprised a known sequence 
of 14 nucleotides which included a restriction endonuclease 
recognition site. The resulting cDNA was used as substrate 

30 for nested PGR, In both rounds of aii:5)lif ication, the 

downstream primer was as described above. In the first PGR 
reaction, the 5' end of the primer contained a known 
restriction enzyme recognition sequence and the 3' sequence 
was a mixture of 48 tetradecamers representing all possible 

35 coding sequences for the first five amino acids of [Ser^JGnRH. 
(See Powell, J.F.F., et al. Regulatory Peptides, sxibmitted.) 
Hie sequence of the this pool of oligonucleotides was as 
follows: GCAGGAATTGCA A/G CA T/G TGGTGNTA T/G GG (Seq. ID No. 
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27) . The product of this reaction served as substrate for 
nested reactions where the new upstreaLin primer was bipartite. 
The 5' end of this primer also contained a known restriction 
enzyme recognition sequence while the 3 ' end comprised a pool 
5 of 384 tetradecamers representing all possible coding 
combinations for residues 5-8 of the decapeptide. The 
sec[uences of these oligonucleotides are represented in the 
following two sequences: GCACGAATTCTA T/C GGN C/T TNTCNCC 
(Seq. ID No. 28) and GCACGAATTCTA T/C GGN C/T TNAG C/T CC 

10 (Seq. ID No. 29) . 

PCR products were separated by electrophoresis in a 
2% GTG agarose gel (FMC Bioproducts, Rockville, Maine, USA) 
and products >250 bases were electroeluted and subcloned into 
M13 phage vector for sequence analysis (see Bond et al. (1991) 

15 J. Mol. Endocrinol., 5:931-937). Nucleotide sequences were 
analyzed for the presence of the second upstream primer 
followed by the codons for the final amino acid of the 
decapeptide, glycine, and the following three amino acid 
residues gly, lys, arg (see figure 2) , These residues follow 

20 the decapeptide sequence in the other cloned GnRH 

preprohormones and serve as substrates for posttranslational 
processing. Several independent clones from 3 separate 
first* round and 7 second- rotind PCR reactions contained these 
Isindmarks followed by an open reading frame of 201 nucleotides 

25 (67 amino acids) . To confirm the identity of these clones as 
the [Ser®] -GnRH preprohormone, templates were visually 
compared after aligziment in a spreadsheet (Color Alignment 
Macros, M. Haygood, Scripps Institute of Oceanography, San 
Diego, California, USA) and a region of consensus was chosen 

30 from which to generate new primers for 5' rapid amplification 
of cDNA ends (RACE). See Frohman et al. in (1993) Hfethods in 
JSnzymoIo^y Volume 218, Wu, R,, editor. Academic Press, Inc., 
San Diego, California, USA for a more detailed description of 
RACE methodology. 

35 The bipartite primer used to make the substrate cDNA 

for the 5' RACE reactions consisted of a randomly generated 
hexamer including all possible combinations of nucleotides at 
7 positions. These cDNA products were tailed at their 5' ends 
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with dATP in a terminal deoxytransf erase reaction (GIBCO BRL, 
Grand Island, New York, USA) • Two rounds of PGR amplification 
were performed on the tailed cDNA in which the 3' antisense 
primer consisted of a bipartite oligonucleotide (poly d{T)17 
5 plus Bam HI recognition site), designed to anneal to the 5' 
terminal poly d (A) s . The outer 5 ' antisense primer for the 
first round of PGR consisted of 18 nucleotides corresponding 
to a region of high consensus amongst the candidate 
preprohormone templates. The sequence of this primer is 

10 CATATTGCCCAGTGTGTC (Seq. ID No. 30). For the second round of 
amplification, the inner 5* antisense primer was bipartite, 
conposed of a known sequence of 14 nucleotides including a 
restriction endonuclease cleavage site plus another 18 
nucleotides from the preprohormone candidate which were 3' to 

15 but did not overlap with the first selected sequence. The 
nucleotide sequence of this primer is 

GCACGAATTCGTGTCTGAQAAGTTGTCC (Seq. ID No. 31). PGR products 
were prepared for sequencing as described above, and were 
screened for the presence of the innermost primer followed by 
20 antisense sequences from the candidate texnplates and by 
antisense codons for the GnRH decapeptide itself. 

Example 8 ; Sequence Analysis of the cDNA encoding H. burton! 
rSer^l -GnRH preprohormone 

25 Approximately 80% of the generated clones were 

positive for the [Ser®] -GnRH- encoding region and extended 
approximately 140 more nucleotides before ending in the poly 
d(T)/BamHl primer sequence. The fullest sequence of this 
preprohormone was obtained by aligning overlapping regions 

30 from consensus 5* RACE templates with consensus sequences 

generated in the primary PGR amplification. Sequences were 
aligned (Color Alignment Macros) and differences attributed to 
PGR- error were assessed. The sequence of the full length cDNA 
is shown as Seq. ID No. 19 and the deduced amino acid secpience 

35 is shown as Seq. ID. No. 20. In each case, a signal sequence 
precedes one of the decapeptide forms, followed by a 
conserved proteolytic processing site and an associated 
peptide. 
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Example 9: Localization of fSer^l -GnRH preprohormone mRNA 
expression in H. Burtoni brain sections by in situ 
hybridization 

The identity of the [Ser®] -GnRH preprohormone 
5 obtained above as that encoding the GnRH contained within 
preoptic area GnRH-ir neurons which govern gonadotropin 
secretion from the pituitary was confirmed through in situ 
analysis on H. burtoni brain sections. These neurons are 
known to enlarge or shrink in size depending on the social and 

10 reproductive state of the fish (see Davis & Femald (1991) J, 
Neurobiol. 21:1180-1188). Tissue sections were prepared from 
both a dominant and a sxibordinate animal, which contain either 
large or small GnRH immunoreactive (GnRH-ir) neurons, 
respectively. Social status was determined by behavioral 

15 observations. Animals were sacrificed, brains removed and 
tissue fixed as previously described in Example 4, herein. 
Cryostat sections (40/xm) were mounted on poly (L- lysine) -coated 
slides for hybridization. 

A hybridization probe was generated by PGR 

20 amplification of a consensus 5' RACE product cloned into M13 
using lac and rev lac primers. Following gel electrophoresis 
performed as described in Example 7, herein, a single band of 
the correct size (ca. 210 bases) was electroeluted and 
fragments were subcloned into pSK+ (Stratagene, San Diego, 

25 California, USA) . Double- stranded sequencing confirmed that 

several clones contained the 5' RACE insert and one was chosen 
as a template for generation of an antisense digoxygenln-UTP 
(Boehringer Mannheim, Indisuaapolis, Indisma, USA) riboprobe. 
The riboprobe was prepared as follows: The plasmid wae 

30 linearized with BanHI (GIBCO/BRL, Grand Island, New York, USA) 
and purified using the Wizard clean-up system from Promega, 
Madison, Wisconsin, USA) . 300 ng was used for transcription 
of an RNA probe with T7 polymerase (GIBCO/BRL, Grand Island, 
New York, USA) and a IxNTP labelling mixture 

35 (Boehringer-Mannheim, Indianapolis, Indiana, USA) as described 
in Example 4, herein. 

Hybridizations were carried out at high stringency 
{60^C in 60% formamide) under conditions slightly modified 
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from those suggested by the digoxygenin manufacturer 
(Boehringer Mannheim, Indianapolis, Indiana, USA) . 150 /zl of 
hybridization solution was applied to slides containing five 
to six brain sections. After hybridization, 2 washes were 
done in 2x SSC, followed by washes in 1.5x and Ix SSC 
respectively at 60 «C with a final wash at room temperature in 
Ix SSC- All washes were for 20 minutes. Slides were 
dehydrated, coverslipped and viewed using a light microscope. 

Figure 5 shows a panel of midsagittal sections from 
H. burton! brain, focusing on the three regions which contain 
GnRH-ir cell populations (see Davis & Fernald (1990), supra). 
The top three regions were hybridized to the tSer^] -GnRH 
riboprobe as described above. Within this top tier, the first 
section is split, showing preoptic area cells labeled with the 
iSer®] -GnRH riboprobe from both a dominant and a subordinate 
male fish. They reveal that the tSer®] -GnRH transcript is 
expressed in the preoptic region within a cell population that 
exhibits neuroiaal size plasticity correlated with social 
state. The bottom two pemels show hybridization of the 
riboprobe in the terminal nerve area and the mesencephalon 
regions of the H. Jburtoni brain of dominant males. Thus, in 
H. burtoni, the three different genes encoding GnRH are 
expressed in three distinct regions of the brain. 

While not wishing to be bound by theory, the in situ 
hybridization data supports an important reproductive role for 
the hypothalamic -preoptic form of GnRH, [Ser®] -GnRH, in the 
cichlid fish H. burtoni. In this species, preoptic GnRH-ir 
cells grow and shrink with social and reproductive state. 
Dominant males have large GnRH -producing cells, large testes, 
and are reproductively active. Subordinate males have small 
GnRH-producing cells, small testes and do not mate. Preoptic 
GnRH gene expression may therefore not only reflect the 
reproductive state of the ainimal, but may also be responsive 
to social cues. 

Also while not wishing to be bound by theory, the 
distribution of tHis5,Trp'',Tyr®] -GnRH raRNA in H. Burtoni 
brain, coupled with other information, suggests that this foim 
of GnRH may coordinate reproductive behavior with reproductive 
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State. [His5,Trp'^,TyrS] -GnRH is the most widespread and 
ancient GnRH isoform, having been immunologically detected in 
caudal brain regions of all non -mammalian vertebrates studied 
to date, often localized to a midbrain neuronal population. 
While in goldfish these cells project to the median eminence 
and result in the delivery of tHis5,Trp'',Tyr®lGnRH (in 
addition to the hypothalamic form) to the pituitary, this 
isoform is not present in the pituitary of H. burtoni. 
Neither is it widely detected in the pituitaries of other 
species. This fact stands in contrast to the marked efficacy 
of the tHis^,Trp'',Tyr®3GnRH isoform in causing release of 
gonadotropins in cultures of dispersed pituitary cells. Thus, 
while this isoform is capable of acting at pituitary GnRH 
receptors, it seldom has the in vivo opportunity to 
demonstrate its functional potency. However, because of this 
biological activity, nucleic acids encoding the 
[His5,Trp'',Tyr8]GnRH precursor, as well as the nucleic acids 
encoding the [Ser®] -GnRH precursor, may be useful in the 
production of transgenic suiimals, including transgenic fish. 

Example 9: Northern blot analysis o f H. burtioni brair ^-^affv ^ 
for the fSerSl -GnRH Precursor 

To determine the size of the [Ser®] -GnRH mRNA 
precursor relative to the sizes of the other GnRH 
preprohormones in H. burtoni. Northern analyses were performed 
at high stringency as described in Example 3, herein. 
Poly (A) + RMA was isolated from ventral and dorsal brain 
regions using the PastTrak kit from Invitrogen Corporation, 
San Diego, California. 1.5 /lig aliguots were loaded onto 
replicate lanes of a denaturing 2.5% agarose gel for size 
separation. The gel was prepared as a Northern blot using 
capillary transfer. Individual lanes were subjected to 
high- stringency hybridizations (50% formamide, 62«»C) with 
ribqprobes for each of the 3 GnRH preprohormones in H. 
burtoni, either alone and in combination. Ribqprobes were 
synthesized with T7 polymerase (GIBCO BRL, Gramd Islemd, New 
York, USA), incorporating [a-32P]UTP (Amersham Corporation, 
Arlington Heights, Illinois, USA) to a specific activity of 3 
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X 108 dpm/ng. Hybridization mixtures contained 107 dpm per 
ml . The [Ser«] -GnRH riboprobe labeled a band of approximately 
550 bases while the size of the [Trp', Leu®] GnRH and 
[Hiss, Trp7, Tyre ] GnRH transcripts are approximately 500 and 590 
bases, respectively. 

The above exarr^jles are provided to illustrate the 
invention but not to limit its scope. Other variants of the 
invention will be readily apparent to one of ordinary skill in 
the art and are encompassed by the appended claims. All 
pxiblications, patents, and patent applications cited herein 
are hereby incorporated by reference. 
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SECJUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NA^: The Board of Trustees of Leland Stanford, Jr. 
University, 
And 

State of Oregon, acting by and through The 
Oregon State Board of Higher Education on behalf 
of The Oregon Health Sciences University 



(B) STREET: 900 Welch Road, Suite 350 

(C) CITY: Palo Alto 

(D) STATE: California 

(E) COUNTRY: U.S.A. 

(F) POSTAL CODE (ZIP): 94304-1858 

(G) TELEPHONE: (415) 723-0651 

(H) TELEFAX: (415) 725-7295 

(I) TELEX: 

(ii) TITLE OF INVENTION: Nucleic acids Encoding 

[Hie-5,Try-7,Tyr-8] -GnRH Preprohortnone and 
[Ser-8} -GnRH preprohormone and 
Their Uses 

(iii) NUMBER OF SEQUENCES: 31 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTVKAPJS: PatentIn Release ^1.0, Version #1.25 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: WO not yet designated 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/147,771 

(B) FILING DATE: 05 -NOV- 1993 

(vii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Weber, Kenneth A. 

(B) KEGISTRATION NUMBER: 31,677 

(C ^^FERENCE /DOCKET NUMBER: 14210-000400PC 

(viii) TE:-.:C0MMCJNICATI0N INFORMATION: 

(A) TELEPHONE: (415) 543-9600 

(B) TELEFAX: (415) 543-5043 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5€3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) KYPOTHEnCAL: NO 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Haplochromis burton! 

(ix) FBATDRE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 135.. 389 



(ix) FEATURE: 

(A) NAME /KEY: mis Cofeature 

(B) LOCATION: 243.-389 

(D) OTHER INFORMATION: /notes "Nucleotide sequence that 
encodes the GAP peptide . ^ 

(ix) FEATURE: 

(A) NAME /KEY: miec^f eature 

(B) LOCATION: 1.-563 

(D) OTHER INFORMATION: /notes= "CDNA sequence that contains 
nucleotide sequence that encodes 
GnRH(5-His,7-Trp, 8-Tyx} from Haplochromis 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



GACTGTCAGC GCAACTGGAT TTTAGCACTA AATCCACCAA AGGAAAAGAA CATTTTGAAG 60 

TGAACCCCTG CAGGTACACT GAGGGAAACT TGGACTGATA AAGCTGTGAA ATCTAAGACT 120 

AAGGTGGGAA TATC ATG TGT GTG TCT CGA CTG GCT TTG CTC TTG GGG CTG 170 
Met Cys Val Ser Arg Leu Ala Leu Leu Leu Gly Leu 
15 10 

CTT CTC TGT GTG GGG GCT CAG CTG TCC TTT GCC CAG CAC TGG TCC CAT 218 
Leu Leu Cys Val Gly Ala Gin Leu Ser Phe Ala Gin His Trp Ser His 
15 20 25 

GGT TGG TAT OCT GGA GGA AAA AGG GAG CTG GAC TCC TTT GGC ACA TCA 266 
Gly Trp Tyr Pro Gly Gly Lys Arg Glu Leu Asp Ser Phe Gly Thr Ser 

. 30- 40 - 

GAG ATT TCA GAG GAG ATT AAG CTG TGT 6AA GCA GGG GAA TGC AGC TAC 314 
Glu He Ser Glu Glu He Lys Leu Cys Glu Ala Gly Glu Cya Ser Tyr 
45 50 55 60 

CTG AGA CCC CAG AGG AGG AGT ATC CTG AGA AAC ATT CTT CTG GAT GCC 362 
Leu Arg Pro Gin Arg Arg Ser He Leu Arg Asn He Leu Leu Asp Ala 
65 70 75 

TTA GCC AGA GAG CTT CAG AAG AGA AAG TGACATCTTT CCAGAGCCTC 409 
Leu Ala Arg Glu Leu Gin Lys Arg Lys 
80 85 



TTTTCTAXAG TAACCCACTT CCCTTTGTAT TTCTGCCTTG ACGTGATTTT GTGAXCATCT 469 
6GCCTTGCTG TTTGTAATGT TXGTCAGTAA ATTTGTCCT6 TTTTTTTC6A TGTGAAAATT 529 
GTGTCCCAAA ATAAATATCT ATTTTTATAT TAAA 563 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



wo 95/12309 



61 



PCTAJS94/12763 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO. 2: 

Met Cys Val Ser Arg Leu Ala Leu Leu Leu Gly Leu Leu Leu Cys Val 
15 10 15 

Gly Ala Gin Leu Ser Phe Ala Gin His Trp Ser Hie Gly Trp Tyr Pro 
20 25 30 

Gly Gly Lys Arg Glu Leu Asp Ser Phe Gly Thr Ser Glu lie Ser Glu 
35 40 45 

Glu lie Lys Leu Cys Glu Ala Gly Glu Cys Ser Tyr Leu Arg Pro Gin 
50 55 60 

Arg Arg Ser lie Leu Arg Asn lie Leu Leu Asp Ala Leu Ala Arg Glu 
65 70 75 80 

Leu Gin Lys Arg Lys 
85 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haplochromis burtoni 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..90 

(D) OTHER INFORMATION: /notes "Name: 
PreproGnRH{7-Trpr 8-Leu} . " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Glu Ala Gly Ser Arg Val lie Met Gin Val Leu Leu Leu Ala Leu 
15 10 15 

Val Val Gin Val Thr Leu Ser Gin His Trp Ser Tyr Gly Trp Leu Pro 
20 25 30 

Gly Gly Lys Arg Ser Val Gly Glu Leu Glu Ala Thr lie Arg Met Met 
35 40 45 

Gly Thr Gly Gly Val Val Ser Leu Pro Asp Glu Ala Asn Ala Gin lie 
50 55 60 

Gin Glu Arg Leu Arg Pro Tyr Asn He He Asn Asp Asp Ser Ser His 
65 70 75 80 

Phe Asp Arg Lys Lys Arg Phe Pro Asn Asn 
85 9' 
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(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: untaown 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mammal 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1..10 

(D) OTHER INFORMATION: /note= "GnRH« 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Glu His Trp Ser Tyr Gly Leu Arg Pro Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO:S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N* terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANIi^: Chicken (I) 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..10 

(D) OTHER INFORMATION: /note= "6nRH{8-Gln} " 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

Glu His Trp Ser Tyr Gly Leu Gin Pro Gly 
15 10 

(2) INFORMATICSN FOR SEQ ID N0:€: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOI^Y: un)cnown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N* terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmon 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..10 

(D) OTHER INFORMATION: /notes "GnRH{7-Trp, 8-Leu} " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Glu His Trp Ser Tyx Gly Trp Leu Pro Gly 
15 10 

(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Catfish 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1..10 

(D) OTHER INFORMATION: /notes «6nRH{5-Hi8, S-Asn} " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Glu His Trp Ser His Gly Leu Asn Pro Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CSiicken (II) 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..10 

(D) OTHER INFORMATION: /noterr "GnRH{5-Hi6, 7-Trp, 8-Tyr} « 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Glu His Trp Ser His Gly Trp Tyr Pro Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: untoown 

(ii) MOLECULE TYPE: protein 

(V) FRACaiBNT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) 0RC3ANISM: Dogfish 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 
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(B) LOCATION: 1. .10 

(D) OTHER INFORMATION: /note= "GnRH{5-His, 7-Trp, 8 -Leu} " 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 

Glu His Trp Ser His Gly Trp Leu Pro Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: hampxBy 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1..10 

(D) OTHER INFORMATION: /note^ 

"GnRH{3-Tyr,5-Leu,€-Glu,7-Trp,8-Lys}" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Glu His Tyr Ser Leu Glu Trp Lys Pro Gly 
15 10 

(2) INFORMATION FOR SEQ ID N0:11 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOIOGY: unknown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lan^rey 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..10 

(D) OTHER INFORMATION: /note=r 

" GnRH { 3 -Tyr , 5 -His , 6 - Asp , 7 - Trp , 8 - Ly B } " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Glu His Tyr Ser His Asp Trp Lys Pro Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 8 amino acide 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alpha -yeast mating factor 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

His Trp Leu Glu Leu Lys Pro Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CARCAYTG6T CNCA 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CA7GQNT(3GT AYCC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCtTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
(SCAGAAGCTT CA6CT 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (gencxnic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGGAATGCAG CTACCTGAGA CCCCAGAGG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 72.. 338 

(D) OTHER INFORMATION: /products "The 

(Iiis-5,Trp-7,Tyr-8] -GnRH preprohormone from 
Treeshrew" 

(ix) FEATURE: 

(A) NAlffi/KEY: misc_f eature 

(B) LOCATION: 60 . . 62 

(D) OTOER INFORMATION: /notes "Encodes potential second 
initiator methionine." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CTCGAGCCAG AAGGAGCCAC CCTACCTATA GCCTCTTCCA CTGCTGTCCC CGTCCTGCCA 60 

TGGCCAGTTC C ATG CTG GGC TTC CTC CTC CTG CTG CTG CTG CTG ATG GCT 110 
Met Leu Gly Phe Leu Leu Leu Leu Leu Leu Leu Met Ala 
15 10 

GCC CAC CCT GGA CCC TCG GAG GCC GAG CAT TGG TCC CAC GGC TGG TAC 158 
Ala Hie Pro Gly Pro Ser Glu Ala Gin His Trp Ser His Gly Trp Tyr 
15 20 25 

CCT GGA GGA AAG CGA GCC TCC AAC TCA CCC CAG GAC CCT CAA A6T GCC 206 
Pro Gly Gly Lys Arg Ala Ser Asn Ser Pro Gin Asp Pro Gin Ser Ala 
30 35 40 45 

err AGG CCC CCA GCC CCC AGC GCA GCC AGA CTG CTC ATA GCT TCC GAA 254 
Leu Arg Pro Pro Ala Pro Ser Ala Ala Arg Leu Leu lie Ala Ser Glu 
50 55 60 

GCG CTG CTC TGG CTT CCC CCG AGG ACA GTG TGC CCT GGG AGG GCA GGA 302 
Ala Leu Leu Trp Leu Pro Pro Arg Thr Val Cye Pro Gly Arg Ala Glv 
65 70 75 
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CCA CAG CAG GAT GGT CTC TCC GCA GGA AGC AC^ ACC TGATGCGGAC 348 
Pro Gin Gin Asp Gly Leu Ser Ala Gly Ser r "hr 
80 85 

ACTGCTGAGC GCAGCCGGAG CGCCGCGCCC CGCCGCCGTv. JCAATAAAGC CGTGAGATTC 408 
CCGAAAAAAA AAAA 422 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: eunino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Leu Gly Phe Leu Leu Leu Leu Leu Leu Leu Met Ala Ala His Pro 
15 10 15 

Gly Pro Ser Glu Ala Gin His Trp Ser His Gly Trp Tyr Pro Gly Gly 
20 25 30 

Lys Arg Ala Ser Asn Ser Pro Gin Asp Pro Gin Ser Ala Leu Arg Pro 
35 40 45 

Pro Ala Pro Ser Ala Ala Arg Leu Leu lie Ala Ser Glu Ala Leu Leu 
50 55 60 

Trp Leu Pro Pro Arg Thr Val Cys Pro Gly Arg Ala Gly Pro Gin Gin 
65 70 75 80 

Asp Gly Leu Ser Ala Gly Ser Ser Thr 
85 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 491 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 71. ,364 

(D) OTHER INFORMATION: /products "The [Ser-8] -GnRH 
preprohormone from H. burtoni" 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 47.. 49 

(D) OTHER INFORMATION: /note= "Encodes potential second 
initiator methionine." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TTAGACTTCA CAAAG6ACAG CAGAAGAGAT CAGAAGTTCT TGTCTAATGC ACAGAAGCTT €0 
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TATCCTCAGA ATG GCT GCA AAA ATC TTG GCA CTG TGG CTG CTG CTC GCA 109 
Met Ala Ala Lys lie Leu Ala Leu Trp Leu Leu Leu Ala 
15 10 

GGG ACG GTG TTT CCA CAG GGC TGC TGT CAG CAC TGG TCA TAC GGA CTG 157 
Gly Thr Val Phe Pro Gin Gly Cys Cys Gin His Trp Ser Tyr Gly Leu 
15 20 25 

AGC CCA GGA GGG AAG AGG GAT CTG GAC AAC TPC TCA GAC ACA CTG GGC 205 
Ser Pro Gly Gly Lys Arg Asp Leu Asp Asn Phe Ser Asp Thr Leu Gly 
30 35 40 45 

AAT ATG GTT GAA GAG TTC CCA CGC GTC GAA GCA CCT TGC AGT GTT TTC 253 
Asn Met Val Glu Glu Phe Pro Arg Val Glu Ala Pro Cys Ser Val Phe 
50 55 60 

GGT TGT GCA GAG GAA TCA CCT TTT GCC AAA ATG TAC AGA GTG AAA GGA 301 
Gly Cys Ala Glu Glu Ser Pro Phe Ala Lys Met Tyr Arg Val Lys Gly 
65 70 75 

CTT CTT GCG AGT GTG GCC GAA AGG AAA ATG GAC ACC GGA CAT TCA AGA 349 
Leu Leu Ala Ser Val Ala Glu Arg Lys Met Asp Thr Gly His Ser Arg 
80 85 90 

AAT GAA AGA TIT CTT TGATTCTACA TTTCATTTTT TATATGAGCA TAAAACATTT 404 
Asn Glu Arg Phe Leu 
95 

TTTTGTGAAT GTTGCTCTTG TCTTATTATC TAAAATATAA ATAAAAGCTT TCAACTCACT 464 
GAAAAAAAAA AAAAAAAAAA AAAAACC 491 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 aihiho 

(B) TY'PE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Ala Ala Lys lie Leu Ala Leu Trp Leu Leu Leu Ala Gly Thr Val 
15 10 15 

Phe Pro Gin Gly Cys Cys Gin His Trp Ser Tyr Gly Leu Ser Pro Gly 
20 25 30 

Gly Lys Arg Asp Leu Asp Asn Phe Ser Asp Thr Leu Gly Asn Met Val 
35 40 45 

Glu Glu Phe Pro Arg Val Glu Ala Pro Cys Ser Val Phe Gly Cys Ala 
50 55 60 

Glu Glu Ser Pro Phe Ala Lys Met Tyr Arg Val Lys Gly Leu Leu Ala 
65 70 . 75 80 

Ser Val Ala Glu Arg Lys iiet Asp Thr Gly His Ser Arg Asn Glu Ara 
85 90 95 



Phe Leu 
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(2) INFORM/ITXON FOR SEQ ID NO: 21: 

(i) SEQqENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : 6 ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
GCACGAATTC CARCAYTGGT CNCA 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: linear 

(ii) ISOLBCOLE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
GCACGAATTC CARCAYTGGA GYCA 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
GCACGAATTC CAYGGNTGGT AYCC 
(2) INFORMATION FOR SEQ ID NO: .4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) HOLBCm^B TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
CAGGGCACAC T6TCCTC 
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(2) INFORSCITION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DKA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25 
GCACGAATTC CGAAGCTATG AGCAGTC 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
GCACGAATTC GGTCCCACGG CTGGTAC 
(2) INFORMATION FOR SEQ IC NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) 2K>IiECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:27 
GCACGAAITC CARCAYTGCrT CNTAYGG 
(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) laN QTH; 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDl^SS : single 

(D) TOPOLOGY: linear 

(ii) l^LECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
GCACGAATTC TAYGGNYTNT CNCC 



wo 95/12309 



71 

(2) INFORHATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
GCACGAATTC TAYGGNYTNA GYCC 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
CATATTGCCC AGTGTCjTC 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) ^LECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
GCACGAATTC GTGTCTGAGA AGTT6TCC 
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WHAT IS CIATMED Tfi . 

1 1. An isolated nucleic acid encoding a vertebrate 

2 [His^,Trp'^,Tyr^] -GnRH preprohormone . 

1 2 . A composition according to claim 1 wherein said 

2 nucleic acid encodes a full-length [His^, Trp'^, Tyr®] -GnRH 

3 preprohormone . 

1 3. A composition according to claim 1 wherein said 

2 preprohormone consists of the amino acid sequence depicted in 

3 Seq. ID No. 2. 

1 4, A composition according to claim 1 wherein said 

2 preprohormone is of fish origin, 

1 5, An isolated nucleic acid encoding a vertebrate 

2 [His^,Trp'^,Tyr®l -GnRH preprohormone GAP peptide. 

1 6. A composition according to claim 5 wherein said 

2 nucleic acid encodes a full-length [His^/Trp'',Tyr®] -GnRH 

3 preprohormone GAP peptide. 



1 7. A composition according to claim 6 wherein said 

2 nucleic acid encodes the GAP peptide as shown in Figure 2. 

1 8. An isolated vertebrate [His^^Trp'^^Tyr®] -GnRH 

2 preprohormone . 



1 9* A composition according to claim 8 wherein said 

2 preprohormone is specifically immunoreactive with antibodies 

3 raised against an imraunogen consisting essentially of a 

4 polypeptide of Seq ID No. 2. 

1 10. A composition according to claim 8 wherein said 

2 preprohoimone is recombinantly produced. 
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1 11. A composition according to claim 8 wherein said 

2 preprohormone is of fish origin. 

1 12. A composition according to claim 8 wherein said 

2 preprohormone is full-length. 

1 13. An isolated vertebrate [His^,Trp'^,Tyr®] -GnRH 

2 preprohormone GAP peptide. 

1 14. A composition according to claim 13 wherein 

2 said GAP peptide is competent to bind antibodies raised 

3 against an immunogen consisting essentially of a GAP peptide 

4 as shown in Figure 2* 

1 15 . A composition according to claim 13 wherein 

2 said GAP peptide is recombinant ly produced. 

1 16. A composition according to claim 13 wherein 

2 said GAP peptide is of fish origin. 

1 17. A corrposition according to claim 13 wherein 

2 said GAP peptide is full-length. 

1 18. An antibody specifically immunoreactive with a 

2 vertebrate [His^ , Trp"^ , Tyr® ] - GnRH preprohormone . 

1 19. A composition according to claim 18 wherein 

2 said antibody is specifically immunoreactive with a vertebrate 

3 lHis^,Trp'',Tyr®] -GnRH preprohormone GAP peptide. 

1 20. A nucleic acid probe capable of selectively 

2 hybridizing to a nucleic acid encoding a vertebrate GnRH 

3 preprohormone. 

1 21. The composition of claim 20 wherein said 

2 nucleic acid consists of Seg. ID No. 1. 
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22. The composition of claim 20 wherein said 
nucleic acid consists of a GAP peptide cDNA sequence as shown 
in Seq. ID No. l. 

23. A transgenic non-human animal having increased 
reproductive capacity due to the expression of a DNA construct 
encoding a vertebrate [His^,Trp'^,Tyr®] -GnRH preprohormone 
introduced into said animal or an ancestor of said animal • 

24. The non-human animal of claim 23 wherein said 
animal is a fish. 

25. A method of detecting a vertebrate 
[His^,Trp*',Tyr®] -GnRH preprohormone in a biological sanple 
conprising the steps of: 

a) contacting said biological sairple with an 
antibody specifically dLmmunoreactive with a vertebrate 
[His^,Trp*^,Tyr®] -GnRH preprohormone; 

b) inciibating said antibody with said biological 
sample to form an antibody: [His^,Trp'',Tyr®] -GnRH preprohormone 
conqplex ; aind ^ 

c) detecting said coxnplex. 

26. A method according to claim 25 wherein said 
biological sample is of fish origin. 

27. A method of detecting a nucleic acid encoding 
a vertebrate [His^,Trp*^,Tyr®] -GnRH preprohormone in a 
biological saxitple comprising: 

a) contacting said biological sample with a nucleic 
acid probe capable of selectively hybridizing to said nucleic 
acid encoding a vertebrate [His^,Trp'^,Tyr®] -GnRH 
preprohormone; 

b) incubating said nucleic acid probe with the 
biological sample to form a hybrid of the nucleic acid probe 
with complementary nucleic acid sequences present in the 
biological sample; and 
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12 c) determining the extent of hybridization of the 

13 n.:cleic acid probe to the complementary nucleic acid 

14 sequences . 

1 28. A method according to claim 27 wherein said 

2 biological sample is of fish origin. 

1 29. A con5)Osition according to claim 1 wherein said 

2 preprohormone is of mammalian origin. 

1 30. A conposition according to claim 1 wherein said 

2 preprohormone consists of the amino acid sequence depicted in 

3 Seq, ID No. 18 • 

1 31. A composition according to claim 8 wherein said 

2 preprohormone is of mammalian origin. 

1 32 . A coirposition according to claim 8 wherein said 

2 preprohormone consists of the amino acid sequence depicted in 

3 Seq. ID No. 18. 

1 33. An isolated nucleic acid encoding a vertebrate 

2 [Ser®] -GnRH preprohormone. 

1 34. A coicposition according to claim 33 wherein 

2 said preprohormone is of fish origin. 

1 35. A coxiposition according to claim 33 wherein 

2 said preprohormone consists of the amino acid sequence 

3 depicted in Seq. ID No. 20 « 

1 36. An isolated vertebrate [Ser®] -GnRH 

2 preprohormone . 

1 37. A composition according to claim 36 wherein 

2 said preprohormone is of fish origin. 
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38. A composition according to claim 36 wherein 
said preprohormone consists of the amino acid sequence 
depicted in Seq. ID No. 20. 
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Flg.2: 

-20 -10 +1 10 

GnRH{7Trp8Leu): MEAGSRVIMQVLLLALWQVTLS QHWSYGWLPG GKR 

GnRH{5His7Trp8Tyr}: MCVSRLALLLGLLLCVGAQLSFA QHWSHGWYPG GKR 

20 30 40 50 60 

SVGELEATIRMMGTGGWSLPDEANAQIQERLRPYNIINDDSSHFDRKKRFPNN* 
ELDSFGTSEISEEIKLCEAGECSYLRPflEESILRNILLDALARELQKRK* 
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